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ZTNC FINCER PROTEIN DERIVATIVES AND METHODS THEREFOR 



This eqjplication is a.continuation-in-pait of application Serial No. 08/312,604, filed 
September 28, 1994, \^ch is a continuation-in-part of Eqjplication Serial No. 08/183,1 19, 
filedJanuary 18, 1994. 

5 BACKGROUND OF THE INVENTION 
1. Field of the Invention 

This invention relates generally to the field of regulation of gene expression and 
specifically to methods of modulating gene expression by utilizing polypeptides derived 
fix)m zinc finger-nucleotide binding proteins. 

10 2. Descr^tion of Related Art 

Tiansciiptional regulation is primarily achieved by the sequence-specific binding of 
proteins to DNA and RNA. Of the known protein motifs involved in the sequence 
specific recognition ofDNA,&e zinc finger protein is unique in its modular nature. To 
date, zinc finger protdns have been identified wfaidi contain between 2 and 37 modules. 
15 More than two hiindredim}teins, many ofthemtransoipdon&ctors, have been shown to 
possess zinc fingers domains. Zinc fingers connect transcription fiictors to their target 
genes mainly by binding to specific sequences of DNA base pairs - the "rungs" in Ae 
DNA "ladder". 

Tiac finger modules are qiproximately 30 amino acid-long moti& foimd in a wide variety 
20 oftranscription regulatory proteins in eukaiyotic organisms. As die name implies, this 
nucleic acid binding protein domain is folded around a zinc ion. The zinc finger domain 
was first recognized in the transcription fector TFIIIA firom Xenopus oocytes (Miller, et 
al. EMBO, 4:1609-1614, 1985; Brown, et al.. FEES Lett.. iM:271-274, 1985). This 
protein consists of nine imperfect repeats of a consensus sequence: 



wo 95/19431 



PCTAJS95/00829 



-2- 

(Tyr. Phe)-X-Cys-X2^-Cys-X3-Phe-X5-Leu-Xr.His-Xj^ffis-X2^ (SEQ ID NO: 1) 
where X is any amino acid. 

Like TFIQA, most anc finger proteins have conserved cysteine and histidine residues that 
tetrahedrally-coordinate tiie single zinc atom in each finger domain. The stmcture of 
5 individual zinc finger pq>tides of this type (containing two cysteines and two histidines) 
such as those found in the yeast protein ADRI, tfie human male associated protein ZFY, 
the HTV enhancer protein and the Xenopus protein Xfin have been solved by high 
resolution NMR methods (Kochoyan, et al.. Biochemistry. 2Q:3371-3386, 1991; 
Omichinski, et aL. Biochemistry. 22:9324-9334, 1990; Lee, etal.. Science. 245:635-637, 

10 1989) and detailed models for &e intoaction of zinc fingers and DNA have been 
proposed (Berg, 1988; Berg, 1990; Chuidiill, etal.. 1990). Moreover, the structure of a 
maee finger polypqitide-DNA complex doived Scorn Has mouse immediate eariy protein 
zi^8 (also known as Krox-24) has been solved by x-ray crystallography (Pavletich and 
Pabo, Science, 222:809-817, 1991). Each finger contains an antiparallel p-tum, a finger 

15 tip r^on and a ^ort amphipaftic a-helix 'wiiich, in the case of a£268 zinc fingers, binds 
in tiie major groove of DNA. In addition, ibs conserved hydrophobic amino acids and 
zinc coordination by tiie cysteine and histidine residues stabilize the structure of die 
individual finger domaiiL 

While the prototype ziac finger protein TFIIIA contains an array of nine zinc fingers 
20 vtiiich binds a 43 bp sequence within the SS UNA genes, regulatory proteins of ^zii268 
class (Krox-20, Spl, for example) contain only three zinc fingos vnHbin a much larger 
polypeptide. The diree zinc finga:sofzi£268 each recognize a 3 bp subsite within a 9 bp 
recognition sequence. Most of the DNA contacts made by zi£268 are with phosphates and 
with guanine residues on one DNA strand in the major groove of the DNA helix. In 
25 contrast, the mechanism of TFIIIA binding to DNA is more complex. The amino- 
terminal 3 zinc fingers recognize a 13 bp sequoice and bind in the major groove. Similar 
to af268, these fingers also make guanine contacts primarily on one strand of the DNA. 
Unlike the ziC68 class of proteins, zinc fingers 4 and 6 of TFIIIA each bind either in or 
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across the minor groove, bringing fingers 5 and 7 through 9 back into contact with the 
major groove (Clemens, etal.. Proc. Natl Acad. Sci. USA, 52:10822-10826, 1992). 

Hie crystal structure of ziQ68, indicates that specific histidine (non-zinc coordinating his 
residues) and arginine residues on the sur&ce of the a-helix participate in DNA 
5 recognition. Specifically, the charged amino acids immediately preceding the a-helix and 
at hdix positions 2, 3, and 6 immediately preceding the conserved histidine) participate 
in hydrogen bonding to DNA guanines. Similar to finger 2 of the regulatory protein 
Krox-20 and fingers 1 and 3 of Spl, finger 2 of TFUIA contains histidine and arginine 
residues at these DNA contact positions; further, each of these zinc fingers minimally 

10 recognizes tiie sequence GGG. Finger sw!9)aq)eriments between transoiption&ctor Spl 
and Krox-20 have confirmed the 3-bp zinc finger recognition code for this class of finger 
proteins (Nardelli, et aL. Nature, M2:17S-178, 1989). Mutagenesis aq)eriments have 
also shown tiie inqxjrtance of Aese amino adds in specifying DNA recognitioiL It would 
be desirable to ascertain a siirtple code which specifies anc finger-nucleotide recognition. 

15 If such a code could be deciphered, then zinc finger polypeptides might be designed to 
bind any chosoi DNA sequence. The conq)lex of such a polypeptide and its recognition 
sequence might be utilized to modulate (ysp or down) the transcriptional activity of the 
goie containing this sequoice. 

Zinc finger proteins have also beoi rqx>rted v^ch bind to RNA. Clemens, et aL, 
20 (Science, 2J5Q:530, 1993) found that fingers 4 to 7 of TTTIIA contribute 95% of the fiee 
energy ofTFniA binding to SSrRNA,vdiereas fingers 1 to 3 make a similar contribution 
in binding tiie promote- of tiie 5S gene. Comparison of the two known 5S RNA binding 
proteins, TFIIIA and p43, reveals few homologies other than the consensus zinc ligands 
(C and H), hydrophobic amino acids and a threonine-tryptophan-threonine triplet motif 
25 in finger 6. 

In order to redesign anc fingers, new selective strategies must be developed and 
additional information on the structural basis of sequence-specific nucleotide recognition 
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is required. Current protein engineering efiforts utilize design strategies based on 
sequence and/or structural analogy. While such a strategy may be siifiicient for the 
transfer of motifs, it limits the ability to produce novel nucleotide binding motifs not 
known in nature. Indeed, the redesign of zinc fingers utilizing an analogy based strategy 
5 has met with only modest success (Desjarlais and Berg, Proteins, 12:101, 1992). 

As a consequence, there exists a need for new strategies for designing additional zinc 
fingers with specific recognition sites as well as novel zinc fingers for enhancing or 
repressing gene expression. The present invention fiilfills this need. 
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SUMMARY OF THE INVENTION 

The invention provides an isolated zinc finger-nucleotide binding polypeptide variant 
comprising at least two zinc finger modules that bind to a cellular nucleotide sequence 
and modulate the flmction of the cellular nucleotide sequence. The variant binds to either 
5 DNA or KNA and nmy enhance or suppress transcription finm a promoter or fix)m within 
a structural gene. The cellular nucleotide sequence may be a sequence v^ch is a 
naturally occurring sequence in the cell, or it may be a viral-derived nucleotide sequence 
in the cell. 

In anotho* embodimrait, the invention provides a pharmaceutical composition conqnising 
10 a thenq)eutically effective amount of a zinc fingo'-nucleotide binding polypeptide 
derivative or a tiiereq)eutically effective amount of a nucleotide sequrace yUdch oicodes 
a zinc finger-nucleotide binding polypeptide derivative, '^^lerein the derivative binds to 
a cellular imcleotide seqtience to modulate the fimction of tbs cellular nucleotide 
sequence, in combination with a pharmaceutically acceptable carrier. 

15 In a fiirther embodiment, tiie invention provides a meAod for inhibiting a cellular 
nucleotide sequence conqnising a zinc finger-nucleotide binding moti^ the method 
comprising contacting fbs motif with a anc finger-nucleotide binding polypeptide 
derivative \t^ch binds the motif. 

In yet a fintfaer embodiment, the invoition provides a method for obtaining an isolated 
20 zinc finger-nucleotide binding polypeptide variant v/bich binds to a cellular nucleotide 
sequence comprising identiiying fbs amino acids in a anc finger-nucleotide binding 
polypeptide that bind to a first cellular nucleotide sequence and modiilate the fimction of 
the nucleotide sequ«ice; creating an expression library encoding the polypeptide variant 
containing randomized substitution of the amino acids identified; expressing the library 
25 in a suitable host cell; and isolating a clone that produces a polypeptide variant that binds 
to a second cellular nucleotide sequence and modulates the fimction of the second 
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nucleotide sequence. Preferably, the expression library encoding the polypeptide variant 
is a phage display library. 

The invention also provides a method of treating a subject with a cell proliferative 
disorder, wherein the disorder is associated witii the modiilation of gene esqjression 
5 associated with a zinc finger-nucleotide binding motif, comprising contacting the zinc 
finger-nucleotide binding motif with an effective amount of a zinc finger-nucleotide 
binding polypeptide derivative that binds to the zinc finger-nucleotide binding motif to 
modulate activity of the gene. 

Further, the invention provides a method for identifying a protein vAdch modulates the 
1 0 function of a cellular nucleotide sequence and binds to a zinc finger-nucleotide binding 
motif comprising incubating components comprising a nucleotide sequence ouxxling the 
putative modulating protein oporably linked to a first inducible promoter, and a reporter 
geos operably linked to a second inducible promoter and a zinc finger-nucleotide binding 
motif, Mkdierein the incubating is carried out imder conditions sufficient to allow the 
1 5 components to intoact; and measuring the effect of the putative modulating protein on 
the e:q)ression of the reporter gene. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 shows a model for the interaction of the zinc fingers of TFIIIA with the 
internal promoter of the SS RNA gene. 

FIGURE 2A shows ibe amino acid sequence of the first Hxree amino tenninal zinc fingers 
5 ofTFIIIA. 

FIGURE 2B shows the niicleotide sequence of the minimal binding site for zf 1-3. 

FIGURE 3 shows a gel mobility diift assay for ibe binding of zfl-3 to a 23 bp ^-labeled 
double stranded oligonucleotide. 

FIGURE 4 shows an aiitoradiogram of in vitro transcription indicating diat zfl-3 blocks 
1 0 transcription by T7 RNA polymerase. 

FIGURE 5 diows binding of zfl-3 to its recognition sequence blocks transcription fmm 
a T7RNA polymerase promoter located neaiby . A plot of percent of DNA molecules 
bound by zfl-3 in a gel mobility shift assay (x-axis) is plotted against percent inhibitiori 
of T7RNA polymerase transcription (y-axis). 

1 5 FIGURE 6 is an autoradiogram showing zfl -3 blocks eukaryotic RNA polymerase III 
transcription in an in vitro transcription system draived i&om unfertilized Xenopus eggs. 

FIGURE 7 shows fhe nucleotide and deduced amino acid sequence for the zinc fingers 
of zif268 which were cloned in pComb 3.5. 
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FIGURE 8 shows the amino acid sequence of the Zif268 protein and the hairpin DNA 
used for phage selection. (A) shows the conserved features of each zinc finger. (B) 
shows the hairpin DNA containing the 9-bp consensus binding site. 

FIGURE 9 is a table listing of the six randomized residues of finger 1 , 2, and 3. 

5 FIGURE 1 0 shows an SDS-PAGE of Zif268 variant AM before IPTG induction (lane 2); 
after IPTG induction (lane 3); cytoplasmic fiaction after removal of mclusion bodies (lane 
4); inclusion bodies containing zinc finger peptide (lane S); and mutant Zi£268 (lane 6). 
Lane 1 is MW Standards (kD). 

FIGURE 1 1 is a table indicating association rate; dissociation rate; and 
1 0 equilibrium dissociation constant, for each protein. 

FIGURE 12 shows dissociation rate (A^ of wild-type Zi£268 protein (WT) (O) and its 
variant C7 (o), by real-time changes in sur&ce plasmon resonance. 

FIGURE 13A and B show the nucleotide and amino acid sequence of Zi£268-Jun (SEQ 
ID NOS: 33 and 34). 

1 5 FIGURE 14A and B show the nxicleotide and amino acid sequence of Zi£268-Fos (SEQ 
ID NOS: 35 and 36). 

FIGURE IS shows the nucleotide and amino acid sequence of the three finger 
construction of C7 zinc finger (SEQ ID NOS: 41 and 42). 

FIGURE 16A and B show fbe nucleotide and amino acid sequence of Zi068-Zif268 
20 linked by a TGEKP linker (SEQ ID NOS: 43 and 44). 



SUBSnTUTE SHEET (RULE 26) 
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DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides an isolated zinc finger-hucleotide binding polypeptide 
variant comprising at least two zinc finger modules that bind to a cellular nucleotide 
sequence and modulate the fimction of the cellular nucleotide sequence. The polypq)tide 
5 variant may enhance or suppress transoiption of a gene, and may bind to DNA or RNA. 
In addition, the invention provides a pharmaceutical composition comprising a 
therapeutically effective amount of a zinc finger-nucleotide binding polypeptide 
derivative or a ther^utically effective amount of a nucleotide sequence that encodes a 
zinc finger-nucleotide binding polypqptide draivative, wherein the derivative binds to a 
1 0 cellular nucleotide sequmre to modulate the fimction of the cellular nucleotide sequence, 
in combination with a pharmaceutically acceptable carrier. The invention also provides 
a screening method for obtaining a zinc finger-nucleotide binding polypeptide variant 
which binds to a cellular or viral nucleotide sequence. 

A zinc finger-nucleotide binding polypeptide "variant" refers to a polypeptide ^ch is 
15 a mutagoiized form of a zinc finger protein or one produced through recombination. A 
variant may be a hybrid yMck contains zinc finger domain(s) fix>m one protein linked to 
zinc finger domain(s) of a second protein, for example. The domains may be wild type 
or mutagenized. A "draivative" includes a truncated form of a wild type zinc finger 
protein, which contains less tbsa the original number of fingers in tiie wild type protein. 
20 A derivative also includes variant zinc finger polypeptides. Examples of zinc finger- 
nucleotide binding polypeptides horn vMch a derivative or variant may be produced 
include TFIHA and a£Z68. 

As used herein a "zinc finger-nucleotide binding motif refers to any two or three- 
dimensional feature of a nucleotide segment to which a zinc finger-nucleotide binding 
25 derivative polypeptide binds with specificity. Included within this definition are 
nucleotide sequences, generally of five nucleotides or less, as well as the three 
dimensional aspects of the DNA double helix, such as the major and minor grooves, die 
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face of the helix, and the like. The motif is typically any sequoice of suitable length to 
vMch the zinc finger polypeptide can bind. For example, a three finger polypeptide binds 
to a motif typically having about 9 to about 14 base pairs. Therefore, the invention 
provides zinc finger-nucleotide binding polypeptides of any specificity, and the zinc 
5 finger binding motif can be any sequence designed by the expeiimeat or to vAdch the zinc 
finger protein binds. The motif may be found in any DNA or RN A sequence, including 
regulatory sequences, exons, introns, or any non-coding sequence. 

In the practice of this invention it is not necessary tiiat the zinc finger-nucleotide binding 
motif be known in order to obtain a zinc-finger nucleotide binding variant polypeptide. 

10 Although zinc finger proteins have so fiir been identified only in eukaryotes, it is 
specifically contemplated within tiie scope of Has invention tiiat anc finger-nucleotide 
binding moti& can be identified in non-eukaiyotic DNA or RNA, especially in ^ native 
promoters of bacteria and viruses by the binding thereto of die genetically modified 
isolated constructs of this invention that preserve the well known structural characteristics 

15 of the zinc finger, but differ from one finger proteins found in nature by their method of 
production, as well as their amino acid sequences and three-dimensional structures. 

The characteristic structure of tiie known wild type zinc Gaga proteins are made up of 
fiom two to as many as 37 modular tandem repeats, with each repeat forming a "finger" 
holding a zinc atom in tetrahedral coordination by means of a pair of conserved cysteines 
20 and a pair of conserved histidines. Generally each finger also contains conserved 
hydrophobic amino adds tiiat interact to form a hydrophobic core that helps ths module 
maintain its shape. 

The zinc finger-nucleotide binding polypeptide variant of the invention comprises at least 
two zinc finger modules that bind to a cellular nucleotide sequence and modulate the 
25 fimction of the cellular nucleotide sequence. The term "cellular nucleotide sequence" 
refers to a nucleotide sequence which is present within the cell. It is not necessary that 
the sequence be a naturally occurring sequence of the cell. For example, a retroviral 
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genome vMch is integrated within a host* s cellular DNA, would be considered a "cellular 
nucleotide sequence". The cellular nucleotide sequence can be DNA or RNA and 
includes both introns and exons. The cell and/or cellular nucleotide sequence can be 
prokaiyotic or eukaryotic, including a yeast, virus, or plant nucleotide sequence. 

5 The term "modulate" refers to tiie suppression, enhancement or induction of a function. 
For example, the zinc finger-nucleotide binding polypqrtide variant of Ae invention may 
modulate a promoter sequence by binding to a motif within the promoter, thereby 
enhancing or suppressing transcription of a gene operatively linked to the promoter 
cellular nucleotide sequence. Alternatively, modulation may include inhibition of 
1 0 transcription of a gene v^iere tiie zinc finger-nucleotide binding polypeptide variant binds 
to the stnictural gene and blodcs DNA dqiendent RNA polymerase from reading through 
the gene, thus inhibiting transcription of the gene. The structural gene may be a normal 
cellular graie or an oncogene, for example. 

The promoter region of a gene includes the regulatoiy elements that typically lie 5' to a 
15 structural gene. If a gene is to be activated, proteins known as transaiption &ctors attadi 
to the promoter region of Ae gme. This assembly resembles an "on switch" by enabling 
an oizyme to transaibe a second genetic segment fix>m DNA into RNA. In most cases 
the resulting RNA molecule serves as a template for synthesis of a specific protein; 
sometimes RNA itself is the final product 

The promoter region may be a normal cellular promote or, for example, an onco- 
promoter. An onco-promoter is goierally a virus-derived promoter. For example, the 
long tenninal repeat (LTR) of r^viruses is a promotor region which may be a target for 
a zinc finger binding polypeptide variant of the invention. Promoters fiom members of 
the LerOmrus group, whidi inchide such pathogens as human T-cell lymphotrophic vims 
(HTLV) 1 and 2, or hiiman immunodeficiraicy virus (HTV) 1 or 2, are examples of viral 
promoter regions which rosy be targeted for transcriptional modulation by a zinc finger 
binding polypeptide of the invention. 



20 



25 
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The zinc finger-nucleotide binding polypeptide derivatives or variants of the invention 
include polypeptides that bind to a cellular nucleotide sequence such as DNA, RNA or 
both. A 2jnc finger-nucleotide binding polypeptide which binds to DNA, and specifically, 
the zinc finger domains \^ch bind to DNA, can be readily identified by examination of 
5 the "linker" region between two zinc finger domains. The linker amino acid sequence 
TGEK(P) (SEQ ID NO: 32) is typically indicative of zinc finger domains vsiiich bind to 
a DNA sequence. . Therefore, one can determine whether a particular zinc finger- 
nucleotide binding polypeptide preferably binds to DNA or RNA by examination of the 
linker amino acids. 

10 In one ranbodiment, a method of the invraition includes a method for inhibiting or 
suppressing the function of a cellular nucleotide sequence comprising a zinc finger- 
nucleotide binding motif wiiich comprises contacting the zinc finger-nucleotide binding 
motif witfi an effective amount of a zinc finger-nucleotide binding polypeptide derivative 
that binds to ibe motif. In die case -wbexe Ihe cellular nucleotide sequmce is a promoter, 

15 the method includes inhibiting tiie transcriptional transactivation of a promoter containing 
a zinc finger-DNA binding motif. The terai "inhibiting" refers to the suppression of the 
level of activation of transmption of a structural gene operably linked to a promoter 
containing a zinc finger-nucleotide binding motif, for example. In addition, the zinc 
finger-nucleotide binding polypeptide derivative may bind a motif witiiin a stiuctural gene 

20 or within an RNA sequence. 

Tlie tenn "effective amount" includes fliat amount vAdch results in the deactivation of a 
previously activated promoter or that amount which results in the inactivation of a 
promoter containing a zinc finger-nucleotide binding motif, or that amount v^ch blocks 
transcription of a structural gene or translation of RNA. The amount of zinc finger 
25 derived-nucleotide binding polypeptide required is that amoimt necessary to either 
displace a native zinc finger-nucleotide binding protein in an existing protein/promoter 
complex, or tiiat amount necessary to compete with the native zinc finger-nucleotide 
binding protein to form a complex with the promoter itself. Similarly, the amount 



95/19431 



PCTAJS95/00829 



-13- 

requiied to blodc a structural gene or RN A is that amount vMch binds to and blocks RNA 
polymoxise from reading through on the gene or that amount which inhibits translation, 
respectively. Preferably, the method is praformed intracellularly. By functionally 
inactivating a promoter or structural gene, transcription or translation is suppressed. 
Delivery of an effective amoimt of the inhibitory protein for binding to or "contacting" 
the cellular nucleotide sequence containing the zinc finger-nucleotide binding protein 
motif, can be accomplished by one of the mechanisms described herein, such as by 
retroviral vectors or liposomes, or other methods well known in the art 

TTie zinc finger-nucleotide binding polypeptide derivative is derived or produced &om a 
wild type zinc fingo^ protein by truncation or expansion, or as a variant of the wild type- 
derived polypeptide by a process of site directed mutagraiesis, or by a combination of (he 
procedures. 

The temi "tnmcaled" refsts to a anc finger-nucleotide binding polypeptide derivative that 
contains less than die fiill number of zinc fingers found in ihc native zinc finger binding 
protein or that has been deleted of non-desired sequences. For example, truncation of the 
zinc finger-nucleotide binding protein TFIIIA, vs4iich naturally contains nine anc fingos, 
might be a polypeptide with only zinc fingers one through three. Bqiansion refors to a 
zinc finger polypeptide to vAdch additional 2dnc finger modules have been added. For 
example, TFIIIA may be extoided to 12 fingers by adding 3 zinc finger domains. In 
addition, a truncated zinc fingerniucleotide binding polypeptide may include zinc finger 
modules fiiom more than one wild type polypeptide, thus resulting in a "hybrid" zinc 
finger-nucleotide binding polypeptide. 

The term "mutagenized" refers to a zinc fingo- derived-nucleotide binding polypeptide 
that has beai obtained by performing any of the known methods for accomplishing 
random or site-directed mutagenesis of the DNA encoding the protein. For instance, in 
TFIHA, mutagenesis can be performed to replace nonconserved residues in one or more 
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of the repeats of the consensus sequence. Truncated zinc finger-nucleotide binding 
proteins can also be mutagenized. 

Examples of known zinc finger-nucleotide binding proteins that can be truncated, 
expanded, and/or miitagenized according to the present invention in order to inhibit the 
5 fimction of a cellular sequence containing a zinc finger-nucleotide binding motif includes 
TFniA and zif268. Otho* zinc finger-nucleotide binding proteins will be known to those 
of skill in the ait 

The invention also provides a pharmaceutical composition comprising a therapeutically 
effective amount of a zinc finger-nucleotide binding polypeptide derivative or a 

1 0 &enq)eutically effective amount of a nucleotide sequence which encodes a zinc finger- 
nucleotide binding polypeptide derivative, \^erein tiie derivative binds to a cellular 
nucleotide sequence to modulate the fimction of the cellular nucleotide sequence, in 
combination with a pharmaceutically acceptable carrier. Pharmaceutical compositions 
containing one or more of the different zinc finger-nucleotide binding derivatives 

1 5 described herein are usefiil in the then^jeutic methods of the invention. 

As used herein, the terms "pharmaceutically acceptable", "physiologically tolerable" and 
grammatical variations thereof, as they refer to compositions, cairias, diluents and 
reagents, are used intochangeably and r^nesent tiiat the materials are capable of 
administration to or \spon a human widiout tiie production of undesirable physiological 
20 effects such as nausea, dizaness, gastric upset and the like ^^ch would be to a degree 
that would prohibit administration of the compositioiL 

The preparation of a pharmacological composition that contains active ingredients 
dissolved or dispersed therein is well understood in &e ait Typically such compositions 
aie prepared as sterile injectables either as liquid solutions or suspensions, aqueous or 
25 non-aqueous, however, solid forms suitable for solution, or suspensions, in liquid prior 
to use can also be prepared. The preparation can also be emulsified. 
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The active ingredient can be mixed with recipients which are phaimaceutically acceptable 
and compatible with the active ingredient and in amounts suitable for use in the 
thers^utic methods described herein. Suitable excipients are, for example, water, saline, 
dextrose, glycerol, ethanol or the like and combinations thereof. In addition, if desired, 
5 the composition can contain minor amoimts of auxiliary substances such as wetting or 
emulsifying agents, as well as pH buffering agents and the like wiiich enhance the 
effectiveness of the active ingredient 

The then^utic pharmaceutical composition of the present invention can include 
phaimaceutically accq>table salts of&e components therein. Pharmaceutically acceptable 

10 salts include the acid addition salts (formed with the free amino groups of the 
polypqjtide) that are formed witii inorganic acids sudi as, for example, hydrochloric or 
phosphoric acids, or such organic acids as acetic, tartaric, mandelic and die like. Salts 
formed with die free caiboxyl groups can also be derived from inorganic bases such as, 
for example, sodium, potassium, ammonium, calciiun or ferric hydroxides, and such 

15 organic bases as isopropylamine, trimethylamine, 2-ethylamino ethanol, histidine, 
procaine and the like. 

Physiologically tolerable carriers are well known in die art Exemplary of liquid carriers 
are stoile aqueous solutions diat contain no materials in addition to the active ingredients 
and water, or contain a buffer such as sodium phosphate at physiological pH value, 
20 physiological saline or both, such as phosphate-buffered saline. Still further, aqueous 
carriers can contain more than one buffer salt, as well as salts such as sodium and 
potassium chlorides, dextrose, propylene glycol, polyethylene glycol and other solutes. 

Liquid compositions can also contain liquid phases in addition to and to the exclusion of 
water. Exemplary of such additional liquid phases are glycerin, vegetable oils such as 
25 cottonseed oil, organic esters such as ethyl oleate, and water-oil emulsions. 
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The invention includes a nucleotide sequence oicoding a zinc finger-nucleotide binding 
polypeptide variant. DNA sequences encoding the zinc finger-nucleotide binding 
polypeptides of the invention, including native, truncated, and expanded polypeptides, can 
be obtained by several methods. For example, the DNA can be isolated using 
5 hybridization procedures which are well known in the art. These include, but are not 
limited to: (1) hybridization of probes to genomic or cDNA libraries to detect shared 
nucleotide sequences; (2) antibody screening of expression libraries to detect shared 
structural features; and (3) synthesis by the polymerase chain reaction (PCR). RNA 
sequences of the invention can be obtained by methods known in the ait (See for example, 
10 Current Protocols in Molecular Biology, Ausubel, et al. eds., 1989). 

The developmrat of specific DNA sequences encoding zinc finger-nucleotide binding 
proteins of the invention can be obtained by: (1) isolation of a double-stranded DNA 
sequence fix>m the genomic DNA; (2) chemical manufacture of a DNA sequence to 
provide Ae necessaiy codons for the polypeptide of interest; and (3) in vitro synthesis of 

15 a double-stranded DNA sequence by reverse transcription of mRNA isolated firom a 
eukaiyotic donor cell. In the latter case, a double-stranded DNA complement of mRNA 
is evoitually formed wfaidi is genoally referred to as cDN A. Of these three methods for 
develqjing specific DNA sequoices for use in recombinant procedures, the isolation of 
gQiomic DNA is the least common. This is especially true when it is desirable to obtain 

20 the mioobial ejqnession of mammalian polypeptides due to tibe presence of introns. 

For obtaining zinc finger derived-DNA binding polypeptides, the synthesis of DNA 
sequmces is fi^quently tiie m^od of choice vihsu the entire sequence of anuno acid 
residues of the desired polypeptide product is known. When the entire sequence of amino 
acid residues of the desired polypeptide is not known, the direct synthesis of DNA 
25 sequences is not possible and the method of choice is the formation of cDNA sequences. 
Among the standard procedures for isolating cDN A sequences of interest is the formation 
of plasmid-cairying cDNA libraries wiiich are derived ftom reverse transcription of 
mRNA which is abundant in donor cells that have a high level of genetic e^qpression. 
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When used in combination with polymerase chain reaction technology, even rare 
expression products can be cloned. In those cases y/bete significant portions of the amino 
acid sequence of the polypeptide are known, the production of labeled single or double- 
stranded DNA or RNA probe sequences diQ>licating a sequence putatively present in the 
5 target cDNA may be employed in DNA/DNA hybridization procedures vliich are carried 
out on cloned copies of ibe cDNA which have been denatured into a single-stranded form 
(Jay, et al.. Nucleic Acid Research, 11:2325, 1 983). 

Hybridization procedures are useful for the screening of recombinant clones by using 
labeled mixed syndietic oligonucleotide probes y^ere each inobe is potentially the 

10 complete complement of a specific DNA sequence in the hybridization sample which 
includes a heterogeneous mixture of denatured double-stranded DNA. For such 
screening, hybridization is preferably performed on either single-stranded DNA or 
doiatured double-stranded DNA. Hybridization is particularly usefiil in the detection of 
cDNA clones derived fiom sources ^ere an extremely low amount of mRNA sequences 

15 relating to the polypeptide of interest are present By using stringent hybridization 
conditions directed to avoid non-specific binding, it is possible, for example, to allow the 
autoradiographic visualization of a spedfic cDNA clone by tiie hybridization of ibe target 
DNA to that single probe in tixe mixture vAach is its complete complement (Wallace, et 
al. Nucleic Acid Research, 2:879, 1981; Maniatis, et al.. Molecular Cloning: A 

20 Laboratory Manual, Cold Spring Harbor Laboratory, 1982). 

Sheening procedures v^iiich rely on nucleic acid hybridization make it possible to isolate 
any gene sequence fiom any organism, provided the s^ropriate probe is available. 
Oligonucleotide probes, which correspond to a part of the sequence encoding the protein 
in question, can be synthesized chemically. This requires that short, oligopeptide 
25 stretches of amino acid sequence must be known. The DNA sequence encoding the 
protein can be deduced fiom the genetic code, however, the degeneracy of the code must 
be taken into account It is possible to perform a mixed addition reaction \s^ien the 
sequence is degenerate. This includes a heterogeneous mixture of denatured double- 
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stranded DNA. For such screening, hybridization is preferably performed on either 
single-stranded DNA or denatured double-stranded DNA. 

Since the DNA sequences of the mvention encode essentially all or part of an zinc finger- 
nucleotide binding protein, it is now a routine matter to prepare, subclone, and express 
5 the truncated polypeptide fragments of DNA from this or corresponding DNA sequences. 
Alternatively, by utilizing the DNA fiagments disclosed herein which delSne the zinc 
finger-nucleotide binding polypeptides of the invention it is possible, in conjunction with 
known techniques, to determine the DNA sequences encoding the entire zinc finger- 
nucleotide binding protein. Such techniques are described in U.S. 4394,443 and U.S. 
1 0 4,446,235 wliich are incorporated herein by reference. 

A cDNA e^qnession library, such as lambda gtll, can be screoied indirectly for zinc 
finger-nucleotide binding protein or for the zinc finger derived polypqjtide having at least 
one epitope, using antibodies specific for ibs zinc finger-nucleotide binding protdrL Such 
antibodies can be either polyclonally or monoclonally derived and used to detect 
1 5 expression product indicative of the presence of anc finger-nucleotide binding protein 
cDNA. Altonatively, binding of the derived polypeptides to DNA targets can be assayed 
by incorporated radiolabeled DNA into the target site and testing for retardation of 
electrophoretic mobility as compared with unbound target site. 

A preferred vector used for idoitification of truncated and/or mutagenized zinc finger- 
20 nucleotide binding polypeptides is a recombinant DNA (rDNA) molecule containing a 
nucleotide sequence that codes for and is curable of expressing a fiision polypeptide 
containing, in the direction of amino- to carboxy-terminus, (1) a prokaryotic secretion 
signal domain, (2) a heterologous polypeptide, and (3) a filamentous phage membrane 
anchor domain. The vector includes DNA e}q)ression control sequences for expressing 
25 the fiision polypeptide, preferably prokaryotic control sequraces. 
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The filamentous phage membrane anchor is preferably a domain of the cpin or cpVUl 
coat protein capable of associating with the matrix of a filamentous phage particle, 
thereby incorporating the fusion polypeptide onto the phage sur&ce. 

The secretion signal is a leader peptide domain of a protein that targets the protein to the 
5 periplasmic membrane of gram negative bacteria. A preferred secretion signal is a pelB 
secretion signal. The predicted amino acid residue sequences of the secretion signal 
domain fit>m two pelB gene product variants &om Envinia carotova are desoibed in Lei, 
a/. (Atoiire, 321:543-546, 1988). 

The leader sequence of the pelB protein has previously been used as a seoetion signal for 
10 fusion proteins (Better, et al.. Science, 24Q: 1041-1 043, 1988; Sastry, et al.. Proa. Natl. 
Acad Sci. USA, M:5728-5732, 1989; and Mullinax, et al., Proc. Natl. Acad. Set USA, 
52:8095-8099, 1990). Amino acid residue sequences for other secretion signal 
polypeptide domains fit)m E. coli 

usefiil in this invration can be found in Oliver, In Neidhard, F.C. (ed.), Escherichia coli 
1 5 and Salmonella Typhimurium, American Society for Microbiology, Washington, D.C., 
1:56-69 (1987). 

Preferred monbrane andiors for i3a& vector are obtainable from filamentous phage Ml 3, 
fl , fd, and equivalent filamentous phage. Prefored membrane anchor domains are found 
in the coat proteins encoded by gene III and gene Vm. The membrane anchor domain 

20 of a filamentous phage coat protein is a portion of the carboxy tominal region of the coat 
protein and includes a region of hydrophobic amino acid residues for spanning a lipid 
bilayer monbrane, and a region of charged amino acid residues normally found at the 
cytoplasmic &ce of the membrane and extending away finm the membrane. In the phage 
fl, gene Vin coat protein's membrane spanning region comprises residue Trp-26 through 

25 Lys-40, and the cytoplasmic region comprises the carboxyterminal 1 1 residues firom 41 
to 52 (Ohkawa, et al., J. Biol Chem., 256:9951-9958, 1981): Thus, the amino acid 
residue sequence of a preferred membrane anchor domain is derived fix)m the Ml 3 
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filamentous phage gene VIII coat protein (also designated cpVIII or CP 8). Gene VIII 
coat protein is present on a mature filamentous phage over the majority of the phage 
particle with typically about 2500 to 3000 copies of the coat protein. 

In addition, the amino add residue sequence of another preferred membrane anchor 
5 domain is d^ved fiom the Ml 3 filamentous phage gene Hi coat protem (also designated 
cpQI). Gene m coat protein is present on a mature filamentous phage at one end of the 
phage particle with typically about 4 to 6 copies of the coat protein. For detailed 
descriptions of the structure of filamentous phage particles, their coat proteins and particle 
assembly, see the reviews by Rached, et al. (Microbiol. Rev., 5Q:401-427 1986; and 
1 0 Model, et al., in "The Bacteriophages: Vol. 2", R. Calendar, ed. Plenum Publishing Co., 
pp. 375-456, 1988). 

DNA e}q>ression control sequences comprise a set of DNA expression signals for 
egqjressing a structural geae product and include both 5' and 3' elements, as is well known, 
opeiatively linked to the cistron such that the cistron is able to esqness a structural gene 
15 product The 5' control sequences define a promoter for initiating transcription and a 
ribosome binding site operatively linked at the 5' teiminus of the upstream translatable 
DNA sequence. 

To adiieve high levels of gene eiqiression in R coli, it is necessary to use not only strong 
promotes to generate large quantities of mRNA, but also ribosome binding sites to ensure 

20 that the mRNA is efSdently translated. In R coli, the ribosome binding site includes an 
initiation codon (AUG) and a sequence 3-9 nucleotides long located 3-1 1 nucleotides 
iq)stream fiom the initiation codon (Shine, et al.. Nature, 254:34, 1975). The sequence, 
AGGAGGU, \^ch is called the Shine-Dalgamo (SD) sequence, is complementary to the 
3' end of £ coli 16S rRNA. Binding of the ribosome to mRNA and the sequence at the 

25 3' end of the mRNA can be affected by several fiictors: 
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(i) The degree of complementarity between the SD sequence and 3' end of 
theieSrRNA. 

(ii) The spacing and possibly the RNA sequence lying between the SD 
sequence and the AUG (Roberts, et al, Proc. Natl. Acad. ScL USA, 
26:760, 1979a; Roberts, et al., Proc. Natl. Acad Sci. USA, 26:5596, 
1979b; Guarcnte, et al.. Science, 202:1428, 1980; and Guarente, et al.. 
Cell, 2Q:543, 1980). Optimization is achieved by measuring the level of 
expression of genes in plasmids in v^ch this spacing is systematically 
altered. Comparison of different mRNAs shows that there are 
statistically prefmed sequences from positions -20 to +1 3 (where the A 
of the AUG is position 0) (Gold, et al.. Arum. Rev. Microbiol, 25:365, 
1981). Leader sequences have been shown to influence translation 
dramatically (Roberts, etal., 1979 a, b stqfra}. 

(iii) The nucleotide sequence following the AUG, vMch affects ribosome 
binding (Taniguchi, et al., J. Mol. Biol., 115:533, 1978). 

The 3' control sequences define at least one termination (stop) codon in firame with and 
operatively linked to the het«t>logous fiision p>. rlypeptide. 

In preferred embodimrats, die vector utilized includes a prokaryotic origin of replication 
or leplicon, Le., a DNA sequence having the ability to direct autonomous replication and 
maintenance of the recombinant DNA molecule extra-chromosomally in a prokaryotic 
host cell, such as a bacterial host cell, transformed therewith. Such origins of replication 
are weU known in the art Preferred origins of replication are those that are efficient in 
the host organism. A preferred host cell is E. coli. For use of a vector in E. coli, a 
preferred origin of replication is ColEl found in pBR322 and a variety of other common 
plasmids. Also preferred is the pl5A origin of replication found on pACYC and its 
derivatives. The ColEl and pi 5A replicon have been extensively utilized in molecular 
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biology, are available on a variety of plasmids and are described at least by Sambrook, 
et ah. Molecular Cloning: a Laboratory Manual, 2nd edition. Cold Spring Harbor 
Laboratoiy Press, 1989). 

The ColEl and pl5A replicons are particularly preferred for use in the present invention 
5 because they each have the ability to direct the rq)lication of plasmid in E. coli while the 
other replicon is present in a second plasmid in Ae same E coli cell. In other words, 
CoIEl and pl5A arc non-interfaing replicons that allow the maintenance of two plasmids 
in the same host (see, for example, Sambrook, et al, stq)ra, at pages 1.3-1 .4). 

In addition, those embodiments that include a prokaryotic replicon also include a gene 
1 0 vk^ose ejq^xression confers a selective advantage, such as drug resistance, to a bacterial 
host transformed thaewith. Typical bacterial drug resistance genes are those that confer 
resistance to ampicillin, tetracycline, neomycin/kanamycin or cholamphenicol. Vectors 
typically also contain convenient restriction sites for insertion of translatable DNA 
sequences. Exemplary vectors are the plasmids pUC8, pUC9, pBR322, and pBR329 
1 5 available from BioRad Laboratories, (Richmond, CA) and pPL and pKK223 available 
fiom Pharmacia, (Piscataway, NJ) and pBS (Stratagene, La Jolla, CA). 

The vector comprises a first cass^ that includes iqistream and downstream translatable 
DNA sequences opaalively linked via a sequence of nxicleotides adapted for directional 
ligation to an insert DNA. The iq)stream translatable sequence encodes the secretion 

20 signal as defined herein. The downstream translatable sequence raicodes the filamentous 
phage membrane andior as defined herein. The cassette preferably includes DNA 
expression control sequences for expressing tiie anc finger-derived polypeptide that is 
produced v^iien an insert translatable DNA sequence (insert DNA) is directionally inserted 
into the cassette via the sequence of nucleotides adapted for directional ligation. The 

25 filamentous phage membrane anchor is preferably a domain of the cpIII or cpVIII coat 
protein capable of binding the matrix of a filamentous phage particle, thereby 
incorporating the fusion polypeptide onto the plu^e surface. 
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The zinc finger derived polypeptide expression vector also contains a second cassette for 
expressing a second receptor poljrpeptide. The second cassette includes a second 
translatable DNA sequence that encodes a secretion signal, as defined herein, operatively 
linked at its 3' terminus via a sequence of nucleotides adapted for directional ligation to 
5 a downstream DNA sequence of the vector that typically defines at least one stop codon 
in the reading fiame of the cassette. The second translatable DNA sequence is operatively 
linked at its S' terminus to DNA expression control sequences forming the S' elements. 
The second cassette is capable, iq)on insertion of a translatable DNA sequence (insert 
DNA), of oqiressing the second fusion polypqjtide comprising a receptor of the secretion 
10 signal with a polypeptide coded by die insert DNA. For purposes ofthis invention, the 
second cassette sequences have been deleted. 

As used herein, the term "vector" refers to a nucleic acid molecule capable of transporting 
between different graietic oivironments another nucleic acid to ^^ch it has been 
operatively linked. Preferred vectors are those capable of autonomous replication and 
1 5 expression of structural gene products present in the DNA segments to which they are 
operatively linked. Vectors, therefore, preferably contain the replicons and selectable 
markers described earlier. 

As used herein with regard to DNA sequences or segments, the phrase "opaatively 
linked" means the sequences or segments have been covalentfy joined, preferably by 

20 convoitional phosphodiester bonds, into one strand of DNA, i^ether in single or double 
stranded form. The choice of vector to ^ch transcription unit or a cassette of this 
invention is operatively linked depends directiy, as is well known in the art, on the 
functional properties desired, e.g., vector replication and protein expression, and the host 
cell to be transformed, these being limitations inherent in the art of constructing 

25 recombinant DNA molecules. 

A sequence of nucleotides adapted for directional ligation, ie., a polylinker, is a region 
of the DNA expression vector that (1) operatively links for replication and transport the 
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upstream and downstream translatable DNA sequences and (2) provides a site or means 
for directional ligation of a DNA sequence into the vector. Typically, a directional 
polylinker is a sequence of nucleotides that defines two or more restriction endonuclease 
recognition sequences, or restriction sites. Upon restriction cleavage, the two sites yield 
5 cohesive termini to which a translatable DNA sequence can be ligated to the DNA 
e^qiression vector. Preferably, the two restriction sites provide, upon restriction cleavage, 
cohesive termini that are non-complementaiy and tiiereby permit directional insertion of 
a translatable DNA sequence into the cassette. In. one embodiment, the directional 
ligation means is provided by nucleotides present in the upstream translatable DNA 
10 sequence, downstream translaitable DNA sequence, or bo&. In another embodiment, the 
sequence of nucleotides adapted for directional ligation comprises a sequence of 
nucleotides that defines multiple directional cloning means. Where ^ sequence of 
nucleotides adapted for directional ligation defines num^us restriction sites, it is 
referred to as a multiple cloning site. 

15 In a preferred embodiment, a DNA expression vector is designed for convenient 
manipulation in the form of a filamentous phage particle encapsulating DNA encoding 
the zinc finger-nucleotide binding polypeptides of the present invention. In this 
embodiment, a DNA esqnession vector fiirdier contains a nucleotide sequence tiiat defines 
a filamentous phage origin of rqilication sudi that the vector, upon presentation of the 

20 appropriate genetic complemoitation, can replicate as a filamentous phage in single 
stranded replicative form and be packaged into filamoitous phage particles. This feature 
provides the ability of the DNA expression vector to be padcaged into phage particles for 
subsequent segregation of the particle, and vector contained tiierein, away fix>m other 
particles that comprise a population of phage particles using screening technique well 

25 known in the art. 

A filamentous phage origin of replication is a region of the phage genome, as is well 
known, that defines sites for initiation of replication, termination of replication and 



wo 95/19431 



PCT/US95rt)0829 



-25- 

packaging of the rq^licadve form produced by replication (see, for example, Rasched, et 
al., Microbiol. Rev., 5Q:401-427, 1986; and Horiuchi, J. Mol. Biol., lfiS:21 5-223, 1986). 

A preferred filamentoiis phage origin of replication for use in the present invention is an 
M13, fl or fd phage origin of replication (Short, et al. (Nucl. Acids Res., 16:7583-7600, 
5 1988). Preferred DNA expression vectors are tiie expression vectors modified pC0MB3 
and specifically pCOMB3.5. 

The production of a DNA sequoure encoding a zinc finger-nucleotide binding polypeptide 
can be accomplished by oligonucleotide(s) which are primers for amplification of the 
genomic polynucleotide oicoding an zinc finger-nucleotide binding polypeptide. These 

1 0 unique oligonucleotide ptimors can be produced based tqx>n identification of the fl a nk i ng 
regions contiguous with the polynucleotide encoding the zinc finger-nucleotide binding 
polypeptide. These oligonucleotide primers comprise sequences which are capable of 
hybridizing with the flanking nucleotide sequence encoding a zinc finger-nucleotide 
binding polypeptide and seqtiences complementary thereto and can be used to introduce 

1 5 point mutations into the amplification products. 

The inimos of the invoition include oligonucleotides of sufGcient length and q^propiiate 
sequence so as to provide specific initiation of polymerization on a significant number of 
nucleic acids in the polynucleotide oicoding the zinc finger-nucleotide bmding 
polypeptide. Specifically, ins tarn "primer" as used herein xcfas to a sequence compris- 

20 ing two or more deoxyribonucleotides or ribonucleotides, preferably more than three, 
vAdda. sequence is capable of initiating ^nthesis of a primer extension product, which is 
substantially complemraitary to a anc finger-nucleotide binding protein strand, but can 
also introduce mutations into the amplification products at selected residue sites. 
E:q}erimental conditions conducive to syntiiesis include the presence of nucleoside 

25 triphosphates and an agent for polymerization and extension, siich as DNA polymerase, 
and a suitable buflFer, temperature and pH. The primer is preferably single stranded for 
n wivimuin efficiency in amplification, but may be double stranded. If double stranded. 
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the primer is first treated to separate the two strands before being used to prepare 
extension products. Preferably, the primer is an oligodeoxyribonucleotide. The primer 
must be sufficiently long to prime the synthesis of extension products in the presence of 
the inducing agent for pol}rmerization and extension of the nucleotides. The exact length 
5 of primer will depend on many fiictois, including temperature, buffer, and nucleotide 
composition. The oligonucleotide primer typically contains 1 5-22 or more nucleotides, 
although it may contain fewer nucleotides. Alternatively, as is well known in the art, the 
mixture of nucleoside triphosphates can be biased to influence the formation of mutations 
to obtain a library of cDNAs encoding putative zinc finger-nucleotide binding 
10 polypeptides that can be screened in a fimctional assay for binding to a zinc finger- 
nucleotide binding motif, such as one in a promoter in which the binding inhibits 
transcriptional activation. 

Primers of die invention are designed to be "substantially" complementary to a segment 
of each strand of polynucleotide encoding the zinc finger-nucleotide binding protein to 

15 be amplified. This means that the primers must be sufQciently complementary to 
hybridize with tfaeir respective strands under conditions ^^ch allow the agent for 
polymoization and nucleotide ext»ision to act In other words, the primers should have 
sufBcirat conq)lementarity with the flanking sequences to hybridize thoewith and permit 
amplification of die polynucleotide encoding the zinc finger-nucleotide binding proteiiL 

20 Preferably, the primers have exact complementarity with the flanking sequence strand. 

Oligonucleotide primers of ibe invoition are employed in the amplification process ^^ch 
is an enqrmatic chain reaction that produces e}qx)nential quantities of polynucleotide 
oicoding the zinc finger-nucleotide binding polypqitide relative to the number of reaction 
steps involved. Typically, one primer is complementary to the negative (-) strand of the 
25 polynucleotide encoding the anc finger-nuCleotide binding protein and die other is 
complementary to the positive (+) strand. Aimealing the primers to denatured nucleic 
acid followed by extension with an enzyme, such as the large fiagment of DNA 
Polymerase I (Klenow) and nucleotides, results in newly synthesized (+) and (-) strands 
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containing the zinc finger-nucleotide binding protein sequence. Because these newly 
synthesized sequences are also templates, repeated cycles of denaturing, primer annealing, 
and extension results in ejqwnential production of the sequence (i.e., the zinc finger- 
nucleotide binding protein polynucleotide sequence) defined by the primer. The product 
5 of the chain reaction is a discrete nucleic acid duplex with termini corresponding to the 
ends of the specific primers employed. Those of skill in the art will know of other 
amplification methodologies vMch can also be utilized to increase the copy number of 
target nucleic acid. These may include for example, ligation activated transcription 
(LAT), ligase cham reaction (LCR), and strand displacemrait activation (SDA), although 
1 0 PCR is the preferred method. 

The oUgonucleotide primers of ^ invention may be prepared using any suitable method, 
such as conventional pho^hotriestra- and phosphodiester methods or automated 
embodiments ftereof In one sudi automated embodiment, diethylphosphoramidites are 
used as starting materials and may be synthesized as described by Beaucage, et al. 
15 {Tetrahedron Letters, 22:1859-1862, 1981). One method for synthesizing 
oligonucleotides on a modified solid siq>port is described in U.S. Patent No. 4,458,066. 
One method of amplification v*ich can be used according to this invention is the 
polymaase chain reaction (PCR) desaibed in U.S. Patent Nos. 4,683,202 and 4,683,195. 

Methods for utilizing filamentous phage libraries to obtain mutations of peptide 
20 sequences are disclosed m U. S. Patent 5,223,409 to Ladner et al., v»*ich is incorporated 
by reference herein in its entirety. 

In one embodiment of the invention, randomized nucleotide substitutions can be 
perfonned on the DN A encoding one or more fingers of a known zinc finger protein to 
obtain a derived polypeptide that modifies gene e^qiression upon binding to a site on the 
25 DNA containing the gene, such as a transcriptional control element. In addition to 
modifications in the amino acids making vcp the zinc finger, the zinc finger derived 
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Polypeptide can contain more or less than the full amonnt of « 
typeproteinfiom^chitisderived. ~ - the 
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Kuhn Hoffinann-Berling (CSH-Quantitative Biology, 42:63, 1978) and techniques for 
using RecA are reviewed in C. Radding (Ann. Rev. Genetics, M:405-437, 1982). 

If the nucleic acid containing the sequence to be amplified is single stranded, its 
complement is synthesized by adding one or two oligonucleotide primers. If a single 
5 primer is utilized, a primer ejctension product is synthesized in the presence of primer, an 
agent for polymerization, and the four nucleoside triphosphates described below. The 
product will be partially complementary to the single-stranded nucleic acid and will 
hybrictize with a single-stranded nucleic acid to form a duplex of unequal length strands 
that may then be separated into single strands to produce two single separated 
10 complanentary strands. Alternatively, two primers may be added to the single-stranded 
nucleic acid and the reaction carried out as desoibed. 

V/hea complementary strands of nucleic acid or acids are separated, regardless of whether 
the nucleic acid was originally double or single stranded, the separated strands are ready 
to be used as a template for the synthesis of additional nucleic acid strands. This 

1 5 syndesis is performed under conditions allowing hybridization of primers to templates 
to occur. Generally synthesis occurs in a buffered aqueous solution, prefonbly at a pH 
of 7-9, most prefiaably about 8. Preferably, a molar excess (for genomic nucleic acid, 
usually about 10': 1 primertenqjlate) of fte two oligonucleotide primers is added to the 
buffer containing flie separated template strands. It is understood, however, ftat the 

20 amount of complementary strand may not be known if tiie process of the invention is used 
for diagnostic ^plications, so that the amount of primer relative to tiie amount of 
complementary strand cannot be determined with certainty. As a practical matter, 
however, the amount of primer added will generally be in molar excess over ttie amount 
of complementary strand (template) when the sequence to be amplified is contained in a 

25 mixture ofcomplicated long-chain nucleic acid strands. A large molar excess is preferred 
to improve the efficiency of the process. 
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The deoxyribonucleotide triphosphates dATP, dCTP, dGTP, and dTTP are added to the 
synthesis mixture, either separately or together with the primers, in adequate amounts and 
the resulting solution is heated to about W^-IOO'C fiom about 1 to 10 minutes, preferably 
fh)m 1 to 4 minutes. After this heating period, the solution is allowed to cool to a 
5 temperature that is preferable for the primer hybridizatioiL To the cooled mixture is 
added an ^propriate agent for effecting the primer extension reaction (called herein 
"agent for polymerization"), and the reaction is allowed to occur under conditions known 
in the art The agent for polymerization may also be added together with the other 
reagents if it is heat stable. This synthesis (or amplification) reaction may occur at room 
10 temperature iq) to a temperature above which the agent for polymerization no longer 
functions. Most convoiiently tiie reaction occurs at room temperature. 

The agent for polymoization may be any compound or system v^ch will function to 
accomplish the synthesis of primer extension products, including enzymes. Suitable 
enzymes for tins purpose include, for example, E. coli DNA polymerase I, Klenow 

15 fragment of K coli DNA polymerase I, T4 DNA polymerase, other available DNA 
polymerases, polymerase muteins, reverse transcriptase, and otho- enzymes, including 
heat-stable en^mes (i,e., those enzymes ^^ch perform primer extension after being 
subjected to temperatures sufBciaitly elevated to cause denaturation). Suitable emgmes 
will &cilitate combination of the nucleotides in the proper manner to form the primer 

20 extension products vMcb are complemraitaiy to each zinc finger-nucleotide binding 
protein nucleic acid strand. Goierally, the syn&esis will be initiated at die 3' end of each 
primer and proceed in the 5' direction along the template strand, until synthesis 
terminates, producing molecules of different Iraigths. There may be agents for 
polymerization, however, vMch initiate synthesis at the 5' end and proceed in the other 

25 direction, using the same process as described above. 

The newly synthesized zinc finger-nucleotide binding polypeptide strand and its 
complementary nucleic acid strand will form a double-stranded molecule under 
hybridiang conditions described above and this hybrid is used in subsequent steps of the 
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process. In the next step, the newly synthesized double-stranded molecule is subjected 
to denaturing conditions using any of the procedures described above to provide single- 
stranded molecules. 

The above process is repeated on the single-stranded molecules. Additional agent for 
5 polymerizatibn, nucleotides, and primers may be added, if necessary, for the reaction to 
proceed under the conditions prescribed above. Again, the synthesis will be initiated at 
one end of each of the oligonucleotide primers and will proceed along the single strands 
of the template to produce additional nucleic acid. After this step, half of the extension 
product will consist of the specific nucleic add sequence bounded by the two primers. 

10 The steps of draiaturing and extension product synthesis can be repeated as often as 
needed to amplify the zinc finger-nucleotide bmding protein nucleic acid sequence to the 
extent necessary for detection. The amount of the specific nucleic acid sequence 
produced will accumulate in an exponential fashion. 

Sequences amplified by the methods of the invration can be furthra: evaluated, detected, 
1 5 cloned, sequenced, and the like, either in solution or after binding to a solid support, by 
any method usually i^lied to the detection of a specific DNA sequence such as PCR, 
oligomer restriction (Saiki, et al.. Bio/Technology, 2:1008-1012, 1985), allele-specific 
oligonucleotide (ASO) probe analysis (Conner, et al., Proc. Natl Acad. ScL USA, SQ:278, 
1983), oligonucleotide ligation assays (OLAs) (Landegren, et al.. Science, 211:1077, 
20 1988), and the like. Molecular techniques for DNA analysis have been reviewed 
(LandegTKi, et al.. Science, 242:229-237, 1988). Preferably, novel zinc finger derived- 
DNA binding polypeptides of the invention can be isolated utilizing the above techniques 
wherein the primers allow modification, such as substitution, of nucleotides such that 
unique zinc fingers are produced (See Examples for fiirther detail). 

25 In the present invention, the zinc finger-nucleotide binding polypeptide encoding 
nucleotide sequences may be inserted into a recombinant expression vector. The term 
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"recombinant expression vector" refers to a plasmid, virus or other vehicle known in the 
art that has been manipulated by insertion or incorporation of zinc finger derived- 
nucleotide binding protein genetic sequences. Such expression vectors contain a 
promotor sequence which facilitates the efficient transcription of the inserted genetic 
5 sequence in the host The expression vector typically contains an origin of replication, 
a promoter, as well as ^)ecific genes which allow phenotypic selection of the transformed 
cells. Vectors suitable for use in the present invention include, but are not limited to the 
T7-based expression vector for e}q)ression in bacteria (Rosenberg, et al. Gene 56:125, 
1987), the pMSXND expression vector for e}q)ression in mammalian cells (Lee and 
1 0 Nathans, J. Biol Chem. 261:3521, 1988) and baculovirus-derived vectors for expression 
in insect cells. The DNA segment can be present in tiie vector operably linked to 
regulatory elements, for example, a promoter (e.g. , T7, metallothionein I, or polyhedrin 
promoters). 

DNA sequences encoding novel zinc finger-nucleotide binding polypeptides of the 
1 5 invention can be esqxressed in vitro by DNA transfer into a suitable host cell. "Host cells" 
are cells in which a vector can be propagated and its DNA expresse± The term also 
includes any progeny of the subject host cell. It is understood that all progeny may not 
be identical to the parental cell since thae may be mutations fliat occur during replication. 
However, such progeny are included when the term "host cell" is used. Methods of stable 
20 transfer, in oflier words vAaea flie foreign DNA is continuously m ai nt ai ne d in the host, are 
known in the art 

Transformation of a host cell with recombinant DNA may be carried out by conventional 
techniques as are well known to those ^lled in the art Where the host is prokaryotic, 
such as E. coli, competent cells which are capable of DNA uptake can be prepared from 
25 cells harvested after exponential growth phase and subsequently treated by the CaClz 
me&od by procedures well known in the art Altanatively, MgClj or RbCl can be used. 
Transformation can also be performed after forming a protoplast of the host cell or by 
electroporation. 
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When the host is a eukaiyote, such methods of tiansfection of DNA as calcium phosphate 
co-precipitates, conventional mechanical procedures such as microinjection, 
electroporation, insertion of a plasmid encased in liposomes, or virus vectors may be 
used. 

5 A variety of host-expression vector systems may be utilized to express the zinc finger 
derived-niicleotide binding coding sequence. These incliide but are not limited to 
microorganisms such as bacteria transformed with recombinant bacteriophage DNA, 
plasmid DNA or cosmid DNA expression vectors containing a zinc finger derived- 
nucleotide binding polypeptide coding sequence; yeast transformed with recombinant 

1 0 yeast ejqnession vectors containing the 2dnc finger-nucleotide binding coding sequence; 
plant cell systems infected witii recombinant virus expression vectors (e.g., cauliflower 
mosaic virus, CaMV; tobacco mosaic virus, TMV) or transformed with recombinant 
plasmid expression vectors Ti plasmid) containing a zinc finger derived-DNA 
binding coding sequm%; insect cell systems infected with recombinant virus expression 

1 5 vectors {e.g., baculovirus) containing a zinc finger-nucleotide binding coding sequence; 
or animal cell systems infected with recombmant virus eiqiression vectors (e.g., 
retroviruses, adenovirus, vaccinia virtis) containing a zinc finger derived-nucleotide 
binding coding sequence, or transformed animal cell systems engineered for stable 
ejqnession. In such cases vdiere glycosylation may be important, ejqiression systrans tiiat 

20 provide for translational and post-translational modifications may be used; e.g., 
mammalian, insect, yeast or plant expression systems. 

Depending on the hostArector system utilized, any of a number of suitable transcription 
and translation elements, including constitutive and inducible promoters, transcription 
enhancer elements, transcription terminators, etc. may be used in Hbs e^qiression vector 
25 (see e.g.. Bitter, et al. Methods in Enzymology, 152:516-544, 1987). For example, \^en 
cloning in bacterial systems, inducible promoters such as pL of bacteriophage A, plac, 
ptrp, ptac (ptrp-lac hybrid promoter) and the like may be used. When cloning in 
mammalian cell systems, promoters derived fiom the genome of m a mm al i an cells {e.g.. 
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metallothionein promoter) or fiom mammalian viruses (e.g., the retrovirus long terminal 
repeat; the adenovirus late promoter, the vaccinia virus 7.SK promoter) may be used. 
Promoters produced by recombinant DNA or synthetic techniques may also be used to 
provide for transcription of the inserted zinc finger-nucleotide binding polypeptide coding 
5 sequ^ice. 

In bacterial systems a number of expression vectors may be advantageously selected 
depending upon the use intraided for the zinc finger derived nucleotide-binding 
polypeptide esqiressed. For example, vAien large quantities are to be produced, vectors 
which direct the e}q)ression of high levels of fusion protein products that are readily 

10 purified may be desirable. Those wUch are engineered to contain a cleavage site to aid 
in recovering the protdn are prefisrred. Such vectors include but are not limited to the E. 
coli raqiression vector pUR278 (Ruflier, e/o/., EAdBOJ., 2:1791, 1983), in v»*ichflie anc 
fingor-nucleotide binding protein coding sequence may be ligated into the vector in fiame 
with the lac Z coding region so that a hyMd zinc finger-lac Z protein is produced; pIN 

15 vectors (Inouye & Inouye, Nucleic Acids Res. 12:3101-3109, 1985; Van Heeke & 
Schuster, J. Biol. Chem. 264:5503-5509, 1989); and the like. 

In yeast, a number of vectcos containing constitutive or inducible promoters may be used. 
For a review see. Current Protocols in Molecttlar Biology, Vol. 2, 1988, Ed. Ausubel, 
et al., Greene Publish. Assoc. & ^iley Interscience, Ch. 13; Grant, et ai., 1987, 

20 Expression and Secretion Vectors far Yeast, in Methods in ^i^maology, Eds. Wu & 
Grossman, 31987, Acad. Prras, N.Y., Vol. 153, pp.516-544; GIovct, 1986, DNA Cloning, 
Vol. n, IRL Press, Wash., D.C., Ch. 3; and Bitter, 1987, Heterologous Gene Eiqjression 
in Yeast, Methods in Enzymology, Eds. Berger & Kimmel, Acad. Press, N.Y., Vol. 152, 
pp. 673-684; and The Molecular Biology of the Yeast Saccharomyces, 1982, Eds. 

25 Strathem et al.. Cold Spring Harbor Press, Vols. I and U. A constitutive yeast promoter 
such as ADH or LEU2 or an inducible promoter such as GAL may be used (Cloning in 
Yeast, Ch. 3, R. Rothstein In: DNA Cloning Vol.11, A Practical Approach, Ed. DM 
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Glover, 1986, IRL Press, Wash., D.C.). Alternatively, vectors may be used which 
promote integration of foreign DNA sequences into the yeast chromosome. 

In cases v/herc plant expression vectors are used, the expression of a zinc finger- 
nucleotide binding polypeptide coding sequence may be driven by any of a number of 
5 promoters. For example, viral promoters such as the 3SS RNA and 1 9S RNA promoters 
of CaMV (Brisson, et al.. Nature, 310:51 1-514, 1984), or the coat protein promoter to 
TMV (Takamatsu, et al, EMBO J., fi:307-3 11,1 987) may be used; alternatively, plant 
promoters sudi as the small subunit of RUBISCO (Coruzzi, etal.,EMBOJ. 1:1671-1680, 
1984; Bioglie, et al.. Science 224:838-843. 1984); or heat shock promoters, e.g., soybean 

10 hspl7.5-E or hspl73-B (Gurley, et al., Mol. Cell. Biol, fi:559-565, 1986) may be used. 
These constructs can be introduced into plant cells using H plasmids, Ri plasmids, plant 
virus vectors, direct DNA transformation, microinjection, electroporation, etc. For 
reviews of such techniques see, for exanq)le, Weissbach & Weissbach, Methods for Plant 
Molecular Biology, Academic Press, NY, Section Vm, pp. 421-463, 1988; and Giierson 

15 & Corey, Plant Molecular Biology, 2d Ed., Blackie, London, Ch. 7-9, 1988. 

An altonative expression system that can be used to expcess a protein of the invention is 
an insect system. In one such system, Autographa califomica nuclear polyhedrosis virus 
(AcNPV) is used as a vector to aqaess foreign genes. The virus grows in Spodoptera 
frugiperda cells. The zinc finger-nucleotide binding polypeptide coding sequence may 

20 be cloned into non-essential regions (Spodoptera frugiperda for example the polyhedrin 
gene) of the virus and placed under control of an AcNPV promoter (for example the 
polyhedrin promoter). Successful insertion of the zinc finger-nucleotide binding 
polypeptide coding sequence will result in inactivation of the polyhedrin gene and 
production of non-occluded recombinant virus {i.e., virus lacking the proteinaceous coat 

25 coded for by the polyhedrin gene). These recombinant viruses are then used to infect cells 
in vAuch the inserted gene is expressed. (Kg., see Smith, et al., J. Biol 4i^:584, 1983; 
Smitii, U.S. Patent No. 4,215,051). 
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Eukaryotic systems, and preferably mammalian expression systems, allow for proper 
post-translational modifications of expressed mammalian proteins to occur. Therefore, 
eukaiyotic cells, such as mammalian cells that possess the cellular machinery for proper 
processing of the primary transcript, glycosylation, phosphorylation, and, advantageously 
5 secretion of the gene product, are tbs preferred host cells for the expression of a zinc 
finger derived-nucleotide binding polypeptide. Such host cell lines may include but are 
not limited to CHO, VERO, BHK, HeLa, COS, MDCK, -293, and WI38. 

Mammalian cell systems that utilize recombinant viruses or viral elements to direct 
e}q)ression may be engineered. For example, w^en using adenovirus egression vectors, 

10 tiie coding sequence of a zinc finger doived polypeptide may be ligated to an adenovirus 
transcription/translation control complex, e.g., the late promoter and tripartite leader 
sequence. This chimeric gene may tfien be inserted into the adenovirus genome by in 
vitro or in vivo recombination. Insertion in a non-essential region of the viral genome 
(e.g., region El or E3) will result in a recombinant virus that is viable and capable of 

1 5 e^qnessing the zinc finger polypeptide in infected hosts (e.g., see Logan & Shenk, Proc. 
Natl. Acad. Sci USA 51:3655-3659, 1984). Alternatively, tiie vaccinia virus 7.5K 
promoter may be used. (e.g., see, Madcett, etal., Proc. Natl. Acad Sci USA, 22:7415- 
7419, 1982; Mackett, el al.. J. Virol 42:857-864, 1984; Panicali, etal., Proc. Natl. Acad 
, Sci. USA , 22:4927-4931, 1982). Of particular interest are vectors based on bovine 

20 pi^illoma virus which have the ability to replicate as extrachromosomal elements (Sarver, 
et al.,Mol. Cell. Biol. 1:486, 1981). Shortly after entry of this DNA into mouse cells, the 
plasmid replicates to about 100 to 200 copies per cell. Transcription of the inserted 
cDNA does not require integration of the plasmid into the host* s chromosome, thereby 
yielding a high level of expression. These vectors can be used for stable expression by 

25 including a selectable marker in the plasmid, such as the neo gene. Alternatively, the 
retroviral genome can be modified for iise as a vector capable of introducing and directing 
the expression of the zinc finger-nucleotide binding protein gene in host cells (Cone & 
Mulligan, Proc. Natl. Acad Sci. USA &l:6349-6353, 1984). ffigh level e;q)ression may 
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also be achieved using inducible promoters, including, but not limited to, the 
metallothionine IIA promoter and heat shock promoters. 

For long-term, high-yield production of recombinant proteins, stable expression is 
preferred. Rather than using e)q}ression vectors v/bich contain viral origins of replication, 
5 host cells can be transformed with the a cDNA controlled by appropriate expression 
control elements (e.g., promoter, enhancer, sequences, transcription terminators, 
polyadenylation sites, etc.), and a selectable marker. The selectable marker in the 
recombinant plasmid confers resistance to the selection and allows cells to stably integrate 
the plasmid into their chromosomes and grow to form foci ^^ch in turn can be cloned 

1 0 and expanded into cell lines. For example, following the introduction of foreign DNA, 
engineered cells may be allowed to grow for 1-2 days in an enriched media, and tben are 
switdied to a selective media. A nimiber of selection systems may be used, including but 
not limited to the herpes simplex virus thymidine kinase O^^gler, et al.. Cell 11:223, 
1977), hypoxantiiine-guanine phosphoribosyltransferase (Szybalska & S^balski, Proc. 

1 5 Natl. Acad Sci. USA, 4S:2026, 1 962), and adenine phosphoribosyltransferase (Lowy, et 
al.. Cell, 22:817, 1980) genes, wWch can be employed in tk", hgprr or aprT cells 
respectively. Also, antimetabolite resistance-conferring genes can be used as the basis 
of selection; for example, tbe goies for dhfr, vAdch conf^ resistance to methotrexate 
(Wigler, etal., Natl. Acad Sci USA .77:3567, 1980; OBare, etaI.,Proc. NatL Acad Sci 

20 tZS«4 . 2fi:l 527, 1 98 1 ); gpt, v^^iich confers resistance to mycophenolic acid (Mulligan & 
Berg, Proc. Natl. Acad Sci USA, 28:2072, 1981; neo, vMch confers resistance to the 
aminoglycoside G-418 (Colbeire-Garapm, et al.,J. Mol. Ao/., 15fl:l, 1981); and hygro, 
which confers resistance to hygromycin (Santoie, et al.. Gene, 3Q:147, 1984). Recraitly, 
additional selectable genes have been described, namely tipB, wdiich allows cells to 

25 utili2e indole in place of tryptophan; hisD, wliich allows cells to utilize histinol in place 
of histidine (Hartman & Mulligan, Proc. Natl. Acad Sci USA, 25:804, 1988); and ODC 
(ornithine decarboxylase) vvWch confers resistance to the ornithine decarboxylase 
inhibitor, 2-(difluoromethyl)-DL-omithine, DFMO (McConlogue L., In: Current 
Communications in Molecular Biology. Cold Spring Harbor Laboratory ed., 1987). 
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Isolation and purification of microbialiy expressed protein, or firagments thereof provided 
by the invention, may be carried out by conventional means including preparative 
chromatography and immunological separations involving monoclonal or polyclonal 
antibodies. Antibodies provided in the present invention are immunoreactive with the 
5 zinc finger-nucleotide binding protein of the invention. Antibody which consists 
essentially of pooled monoclonal antibodies with different epitopic specificities, as well 
as distinct monoclonal antibody preparations are provided. Monoclonal antibodies are 
made fiom antigen containing fiagments of the protein by methods well known in the art 
(Kohler, et al., NcOure, 256:495. 1 975; Current Protocols in Molecular Biology, Ausubel, 
10 e/fl/.,ed.,1989). 

The present invention also provides gene ther^y for the treatment of cell proliferative 
disorders vMch are associated with a cellular nucleotide sequence containing a zinc 
finger-nucleotide binding motif. Such ihsapy would achieve its therapeutic effect by 
introduction of the zinc finger-nucleotide binding polypeptide polynucleotide, into cells 
15 of animals having tiie proliferative disorder. Delivery of a polynucleotide encoding a zinc 
finger-nucleotide binding protein can be adiieved using a recombinant expression vector 
such as a chimeric virus or a colloidal dispo^on system, for example. 

The term "cell-proliferadve disorder" denotes malignant as well as non-malignant cell 
populations v^ch morphologically often appear to differ fiom the surrounding tissue. 

20 The cell-proliferative disorder may be a transcriptional disorder which results in an 
increase or a decrease in geae eiqpression level. The cause of the disorder may be of 
cellular origin or viral origin. Gene therapy using a zinc finger-nucleotide binding 
polypeptide can be used to treat a viriis-iiuiuced cell proliferative disorder in a humaii, for 
example, as well as in a plant Treatment can be prophylactic in order to make a plant 

25 cell, for example, resistant to a virus, or therapeutic, in order to ameliorate an established 
infection in a cell, by preventing production of viral products. 
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A polynucleotide encoding the zinc finger-nucleotide binding polypeptide is iiseful in 
treating malignancies of the various organ systems, such as, for example, lung, breast, 
lymphoid, gastrointestinal, and genito-urinaiy tract as well as adenocarcinomas which 
include malignancies such as most colon cancers, renal-cell carcinoma, prostate cancer, 
5 non-small cell carcinoma of the lung, cancer of the small intestine, and cancer of the 
esophagus. A polynucleotide encoding the zinc finger-nucleotide binding polypeptide is 
also usefiil in treating non-malignant cell-proliferative diseases such as psoriasis, 
pemphigus vulgaris, Behcet's syndrome, and lipid histiocytosis. Essentially, any disorder 
which is etiologically linked to the activation of a zinc finger-nucleotide binding motif 
10 containing promoter, structural gene, or RNA, would be considered susceptible to 
treatment with a polynucleotide encoding a derivative or variant zinc finger derived- 
nucleotide binding polypqitide. 

Various viral vectors that can be utilized for gene therapy as taught herein include 
adenovirus, herpes virus, vaccinia, or, preferably, an RNA virus such as a retrovirus. 

15 Prefaably,1heretroviral vector is a derivative of a murine or avian retrovirus. Examples 
of retroviral vectors in \vbich a single foreign gene can be inserted include, but are not 
limited to: Moloney murine leukemia virus (MoMuLV), Harvey murine sarcoma virus 
(HaMuSV), murine mammary tumor virus (MuMTV), and Rous Sarcoma Virus (RSV). 
A number of additional retroviral vectors can incorporate multiple genes. All of these 

20 vectors can transfer or incorporate a gene for a selectable maricer so that transduced cells 
can be identified and generated. By inserting a zinc finger derived-DNA binding 
polypeptide sequraice of interest into the viral vector, along with another gene that 
encodes the ligand for a receptor on a specific target cell, for exan^le, Ae vector is made 
target specific. Retroviral vectors can be made target specific by inserting, for example, 

25 a polynucleotide encoding a protein. Preferred targeting is accomplished by using an 
antibody to target the retroviral vector. Those of skill in the art will know of, or can 
readily ascertain without undue experimentation, specific polynucleotide sequences which 
can be inserted into the retroviral genome to allow target specific delivejy of the retroviral 
vector containing the zinc finger-nucleotide binding protein polynucleotide. 
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Since recombinant retroviruses are defective, they require assistance in order to produce 
infectious vector particles. This assistance can be provided, for racample, by using helper 
cell lines that contain plasmids encoding all of the structural genes of the retrovirus under 
the control of regulatory sequences within the LTR. These plasmids are missing a 
5 nucleotide sequence wUch enables the packaging mechanism to recognize an RNA 
transcript for encapsitation. Helper cell lines vMch have deletions of the packaging 
signal.includebutarenotlimitedto T2,PA317andPA12,forexample. TheseceUlines 
produce empty virions, since no g«iome is packaged. If a retroviral vector is introduced 
into such cells in \(^ch the packaging signal is intact, but the structural genes are 
10 ' replaced by other genes of interest, the vector can be packaged and vector virion 
produced. The vector virions produced by tiiis method can tiasa be used to infect a tissue 
cell line, such as NIH 3T3 cells, to produce large quantities of chimoic retroviral virions. 

Another targeted delivoy system for polynucleotides encoding zinc finger derived-DNA 
binding polypeptides is a colloidal dispersion system. Colloidal dispersion systems 

1 5 include macromolecule complexes, nanocapsules, microspheres, beads, and lipid-based 
systems including oil-in-wato' emulsions, micelles, mixed micelles, and liposomes. The 
preferred colloidal system of this invoition is a liposome. Liposomes axe artificial 
membranevesicles\^diareusefulasdelivay vehicles ^vi/ro and in vivo. It has been 
shown that large unilamellar vesicles (LUV), vUdch range in size fix>m 0.2-4.0 um can 

20 encapsulate a substantial percentage of an aqueous buffer containing large 
maoromolecules. RNA, DNA and intact virions can be encapsulated within the aqueous 
interior and be delivered to cells in a biologically active form (Fraley, et al.. Trends 
Biochem. ScL, fi:77, 1981). In addition to mammalian cells, liposomes have been used 
for delivery of polynucleotides in plant, yeast and bact^ial cells. In order for a liposome 

25 to be an efGcient gene transfer vehicle, the following characteristics should be present: 
(1 ) enc£^)sulation of the genes of inta«st at high efGciency while not compromising their 
biological activity; (2) preferential and substantial binding to a target cell in comparison 
to non-target cells; (3) delivery of the aqiieous contents of the vesicle to the target cell 
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cytoplasm at high efficiency; and (4) accurate and effective e^qjression of genetic 
information (Mannino, etal. Biotechniques, ^:682, 1988). 

The composition of the liposome is usually a combination of phospholipids, particularly 
high-phase-transition-temperature phospholipids, usually in combination with steroids, 
5 especially cholesterol. Other phospholipids or other lipids may also be used. The 
physical characteristics of liposomes dqiend on pH, ionic strength, and the presence of 
divalent cations. 

Examples of lipids useful in liposome production include phosphati^l compounds, such 
as phosphatidylglycerol, phosphatidylcholine, phosphatidylserine, phosphatidyletha- 
10 nolamine, sphingolipids, cerebrosides, and gangliosides. Particularly useful are 
diacyiphosphatidylglycerols, vdiere the lipid moiety contains fiom 14-18 carbon atoms, 
particularly finm 16-18 carbon atoms, and is saturated. Illustndve phospholipids include 
egg phosphatidylcholine, dipalmitoylphosphatidylcholine and 
distearoylphosphatidylcholine. 

1 5 The targeting of liposomes has been classified based on anatomical and mechanistic 
factors. Anatomical classification is based on the level of selectivity, fat example, organ- 
specific, cell-spedfic, and organelle-specific. Medianistic targeting can be distinguished 
based u^n v^ethra- it is passive or active. Passive targeting utilizes tfie natural tendency 
of liposomes to distribute to cells of the reticulo-endothelial system (RES) in organs 

20 M^ch contain sinusoidal «q)illaries. Active targeting, on the other hand, involves 
alteration of the liposome by coupling the liposome to a specific ligand such as a 
monoclonal antibody, sugar, glycolipid, or protein, or by changing ^ composition or size 
of the liposome in order to achieve targeting to organs and cell types other than the 
naturally occurring sites of localization. 

25 The surface of the targeted delivery Systran may be modified in a variety of ways. In the 
case of a liposomal targeted delivery system, lipid groups can be incorporated into the 
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lipid bilayo: of the liposome in order to maintain the targeting iigand in stable association 
with the liposomal bilayer. Various linking groups can be used for joining the lipid 
chains to the targeting ligand. 

In general, the compounds boimd to the surface of the targeted delivery system will be 
5 ligands and receptors whidi will allow the targeted delivery system to find and "home in" 
on the desired cells. A ligand may be any compound of interest which will bind to 
another compound, such as a receptor. 

In general, surface membrane proteins which bind to sp)ecific effector molecules are 
referred to as receptors. In fbs present invraition, antibodies are preferred receptors. 
Antibodies can be used to target liposomes to specific cell-sur&ce ligands. For example, 
certain antigens expressed specifically on tumor cells, referred to as tumor-associated 
antigens (TAAs), may be exploited for the purpose of targeting antibody-zinc finger- 
nucleotide binding protein-containing liposomes directly to the mali gnant tumor. Since 
the zinc finger-nucleotide binding protein gene product may be indiscriminate with 
respect to cell type in its action, a targeted delivoy system offers a significant 
improvemrait over randomly injecting non-specific liposomes. A number of procedures 
can be used to covalently attach either polyclonal or monoclonal antibodies to a liposome 
bilayer. Antibody-targeted liposomes can include monoclonal or polyclonal antibodies 
or fir^ments thereof such as Fab, or F(ab')2> as long as tiiey bmd efBciently to an the 
antigenic epitope on the target cells. Liposomes may also be targeted to cells expressing 
receptors for hormones or other serum fitctors. 

In another embodiment, the invention provides a method for obtaining an isolated zinc 
finger-n\icleotide binding polypeptide variant which binds to a cellular nucleotide 
sequence comprising, first, identifying the amino acids in a anc finger-nucleotide binding 
25 polypeptide ihat bind to a first cellular nucleotide sequence and modulate the function of 
the nucleotide sequence. Second, an expression library encoding the polypeptide variant 
containing randomized substitution of the amino acids identified in the first step is 
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created. Third, the library is expressed in a suitable host cell, vAdch will be apparent to 
those of skill in the art, and finally, a clone is isolated that produces a polypqitide variant 
that binds to a second cellular nucleotide sequence and modulates the function of the 
second nucleotide sequence. The invention also includes a zinc finger-nucleotide binding 
5 polypeptide variant produced by the method described above. 

Preferably, a phage sur&ce e^qiression system, as described in the Examples of the 
present disclosure, is utilized as the library. The phage library is treated with a reducing 
reagent, such as dithiothreitol, which allows proper folding of the expression product on 
the phage surface. The library is made firom polynucleotide sequences which encode a 

10 zinc finger-nucleotide binding polypeptide variant and vMch have been randomized, 
preferably by PCR using primers containing degenerate triplet codons at sequence 
locations corresponding to the determined amino acids in the first step of the method. 
The degenerate triplet codons have the formula NNS or NNK, wherein S is either G or 
C, K is either G or T, and N is independently selected fix>m the groiq) consisting of A, C, 

15 G,orT. 

The modulation of the function of the cellular nucleotide sequence includes the 
enhancement or suppression of transcription of a gene operatively linked to tiie cellular 
nucleotide sequence, particularly v^en the nucleotide sequence is a promoter. The 
modulation also includes suppression of transcription of a nucleotide sequence y^ch is 
20 within a structural gene or a virus DNA or RNA sequence. Modulation also includes 
inhibition of translation of a messenger RNA. 

In addition, the invention discloses a method of treatmg a cell proliferative disorder, by 
the ex vivo introduction of a recombinant expression vector comprising the polynucleotide 
encoding a zinc finger-nucleotide binding polypeptide into a cell to modulate in a cell the 
25 function ofa nucleotide sequence comprising a zinc finger-nucleotide binding motif. The 
cell proliferative disorder comprises those disorders as described above which are 
typically associated with transcription of a gene at reduced or increased levels. The 
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method of the invention oflFers a technique for modulating such gene ejqiression, whether 
at the promoter, structural gene, or RNA level. The method includes the removal of a 
tissue sample fiom a subject with the disorder, isolating hematopoietic or other cells from 
the tissue sample, and contacting isolated cells with a recombinant expression vector 
5 containing the DNA encoding zinc finger-nucleotide binding inotein and, optionally, a 
target specific gene. Optionally, the cells can be treated with a growth factor, such as 
iiiterleukin-2 for example, to stimulate cell growtii, before reintroducing the cells into the 
subject. When reintroduced, the cells will specifically target the cell population from 
v»Wchthqr were originally isolated. In this way, the trans-repressing activity of the zinc 
1 0 finger-nucleotide binding polypeptide may be used to inhibit or siqipress undesirable cell 
proliferation in a subject In certain cases, modulation of the nucleotide sequence in a cell 
refers to suiqnession or oibancement of the transcription of a graie operatively linked to 
a cellular niicleotide sequence. Preferably, the subject is a human. 

An alternative use for recombinant retroviral vectors comprises the introduction of 
1 5 polynucleotide sequences into the host by means of skin transplants of cells containing 
the virus. Long tenn ejqnession of foreign genes in implants, using cells of fibroblast 
origin, may be achieved if a strong housekeeping graie promoter is used to drive 
transcription. For example, the dihydrofolate reductase (DHFR) gene promoter may be 
used. Cells such as fibroblasts, can be infected with virions containing a retroviral 
20 construct containing flie gene of intaest, fm example a truncated and/or mu t a geniz ed zinc 
finger-nucleotide binding protein, together with a gene which allows for specific 
targeting, sudi as tumor-assodated antigoi (TAA), and a strong promoter. The infected 
cells can be embedded in a collagen matrix vAach can be grafted into the connective 
tissue ofthedomis in the recipient subject As tiie retrovirus jMolifiaates and esc^jes the 
25 matrix it will specifically infect the target cell population. In this way the transplantation 
results in increased amounts of trans-repressing zinc finger-nucleotide binding 
polypeptide being produced in cells manifesting the cell proliferative disorder. 
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The novel zinc finger-nucleotide binding proteins of the invention, which modulate 
transcriptional activation or translation either at the promoter, structural gene, or RNA 
level, could be used in plant species as well. Transgenic plants would be produced such 
that the plant is resistant to particular bacterial or viral pathogens, for example. Methods 
5 for transferring and e^qxressing nucleic acids in plants aie well known in the art (See for 
example, Hiatt, et al., U.S. Patent No. 532,422, incorporated herein by reference.) 

In a further embodiment, the invention provides a method for identifying a modulating 
polypeptide derived fiom a zinc finga--niicleotide binding polypeptide that binds to a zinc 
finger-nucleotide binding motif of intoest comprising incubating components, 
10 comprising a nucleotide sequence encoding the putative modulating protein operably 
linked to a first inducible promoter and a reporter gene operably linked to a second 
inducible promoter and a anc finger-nucleotide binding moti^ wherein the incubating is 
carried out under conditions sufGci^it to allow the components to interact, and measuring 
the effect of the putative modulating protein on the expression of the reporter gene. 

15 The term "modulating" divisions the inhibition or suppression of expression fiom a 
promoter containing a zmc finger-nucleotide binding motif v/bea it is over-activated, or 
augmentation or enhancement of expression fipom such a promoter v^en it is imder- 
activated. A first inducible i»omoter, sudi as the arabinose promoter, is operably linked 
to tbs nucleotide sequence encoding tiie putative modulating polypeptide. A second 

20 inducible promoter, such as the lactose promoter, is operably linked to a anc finger 
derived-DNA binding motif followed by a reporter gene, such as p-galactosidase. 
Incubation of the components may be in vitro or in vivo. In vivo incubation may include 
prokaiyotic or eukaryotic systems, such as Kcoli or COS cells, respectively. Conditions 
vAach. allow the assay to proceed include incubation in the presence of a substance, such 

25 as axabinose and lactose, which activate the first and second inducible promoters, 
respectively, thereby allowing expression of the nucleotide sequence encoding the 
putative trans-modulating protein nucleotide sequence. Whether or not the putative 
modulating protein binds to the zinc finger-nucleotide binding motif which is operably 
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linked to the second inducible promoter, and affects its activity is measured by the 
expression of the reporter gene. For example, if the reporter gene was p-galactosidase, 
the presence of blue or vAdts plaques would indicate v^ether the putative modulating 
protein enhances or inhibits, respectively, gene expression from the promoter. Other 
5 commonly used assays to assess the fimction fiom a promoter, including chloramphenicol 
acetyl transferase (CAT) assay, will be known to those of skill in the art Both prokaryote 
and eukaryote systems can be utilized. 

The invention is useiiil for the identification of a novel zinc finger-nucleotide binding 
polypeptide derivative or variant and the nucleotide sequence encoding the polypeptide. 

1 0 The method entails modification of the fingers of a wild type zinc finger protein so that 
th^ recognize a nucleotide, either DNA or KNA, sequence other than the sequence 
originally recognized by that protein. For example, it may be desirable to modify a 
known zinc finger protein to produce a new anc finger-nucleotide binding poljrpeptide 
that recognizes, binds to, and inactivates the promoter region (LTR) of human immunode- 

15 ficiency virus (HTV). Following identification of the protein, a truncated form of the 
protein is produced that represses transcription normally activated fix>m&at site. InHIV, 
the target site for a anc finga--nucleotide binding motif within tiie promoter is CTG- 
TTG-TGT. The three fingers of zi^8, for example, are mutagaiized, as described in the 
examples. The fingers are mutagenized independently on the same protein (one by one), 

20 or independently or "piecewise" on three different zif268 molecules and religated after 
being mutagenized. Although one of these two methods is preferable, an alternative 
method would allow the three fingers to be mutagoiized simultaneously. After 
miitagenesis, a phage display library is constructed and screened with the appropnate 
oligoniicleotides vAach include the binding site of interest If the fingers were 

25 mutagenized independently on the same protein, sequential libraries are constructed and 
panning performed after each library construction. For example, in zif268, a finger 3 
library is constructed and panned wifli a finger 3 specific oligo; the positive clones from 
this screen are collected and utilized to make a finger 2 library (using finger 3 library 
DNA as a tonplate); panning is performed with a finger 32 specific oligo; DNA is 
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collected from positive clones and used as a template for finger 1 library construction; 
finally selection for a protein >vith 3 new fingers is performed with a finger 321 specific 
oligo. The method results in identification of a new zinc finger derived-DNA binding 
protein that recognizes, binds to, and represses transcription bom the HTV promoter. 
5 Subsequent truncation, mutation, or e:q}ansion of various fingers of the new protein 
would result in a protein which represses transcription fix)m the HTV promoter. 

The invention provides, in EXAMPLES 7-13, an illustration of modification of Zif268 
as described above. Therefore, in another embodiment, the invention provides a novel 
zinc-finger-nucleotide binding polypeptide variant comprising at least two zinc finger 
1 0 modules that bind to an HTV sequence and modulates the fimction of the HIV sequence, 
for example, the HIV promoter sequence. 

The identification of novel zinc finger-nucleotide binding proteins allows modulation of 
gaae ejqnession from promoters to vMch these proteins bind. For example, v/hea a cell 
prolifoative disorder is associated with overactivation of a promoter vAdch contains a 

15 zinc finger-nucleotide binding motif, such suppressive reagents as antisense 
polynucleotide sequence or binding antibody can be introduced to a cell, as an alternative 
to flie addition ofa zinc finger-nucleotide binding protein derivative. Alternatively, ^^irai 
a cell proliferative disorder is associated with underactivation of the promoter, a sense 
polynucleotide sequence (the DNA coding strand) or zinc fingerniucleotide binding 

20 polypeptide can be introduced into the cell. 

Minor modifications of the primary amino acid sequence may result in proteins vAuch 
have substantially equivalent activity compared to the zinc finger derived-binding protein 
described herein. Such modifications may be deliberate, as by site-directed mutagenesis, 
or may be spontaneous. All proteins produced by these modifications are included herein 
25 as long as zinc finger-nucleotide binding protein activity exists. 
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In another embodiment, zinc finger proteins of the invention can be manipulated to 
recognize and bind to extended target sequences. For example, zinc finger proteins 
containing &om about 2 to 20 zinc fingers Zif(2) to Zif(20), and preferably Smm about 
2 to 12 zinc fingers, may be fiised to the leucine zipper domains of the Jun/Fos proteins, 
5 prototypical members of the bZIP family of proteins (O'Shea, et al. Science, 254:539. 
1 99 1 ). Alternatively, zinc finger proteins can be fiised to other proteins \siiich are capable 
offomiingheterodimers and contain dimerization domains. Such proteins will be known 
to those of skill in the art 

The Jun/Fos leucine zippers are described for illustrative purposes and prefaenfially form 

1 0 heterodimers and allow for the recognition of 12 to 72 base pairs. Henceforth, Jun/Fos 
refer to the leucine zipper domains of these proteins. Zinc finger proteins are fiised to 
Jun, and independently to Fos by methods commonly used in the art to link proteins. 
Following purification, the 2^-Jun and Zif-Fos constructs (SEQ ID NOS: 33, 34 and 35, 
36 respectively), die proteins are mixed to spontaneously form a Zif-Jun/Zif-Fos 

1 5 heterodhner. Alternatively, coexpression of the genes encoding these proteins results in 
file formation ofZif-Jun/Zif-Fos heterodimers in vrvo. Fusion offiieheterodimer with an 
N-terminal nuclear localization signal allows for targeting of ejqnession to the nucleus 
(Calderon, et al. Cell, 41:499, 1982). Activation domains may also be incorporated into 
one or eadi of the leucine zipper fiision constructs to produce activators of transcription 

20 (Sadow^ et aL. Gene, 118:137. 1992). Tliese dimeric constructs then allow for specific 
activation or repression of transcription. These heterodimeric Zif constructs are 
advantageous since Haey allow for recognition of palindromic sequences (if the fingers on 
bofii Jun and Fos recognize fiie same DNA/RNA sequence) or extended asymmetric 
sequences Of fiie fingers on Jun and Fos recognize different DNA/RNA sequences). For 

25 example the palindromic sequence 



5' 
3 ' 
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is recognized by the Zi£268-Fos/Zif268 Jun dimer (x is any number). The spacing 
between subsites is detennined by the site of fusion of Zif with the Jun or Fos zipper 
domains and the length of the linker between the Zif and zipper domains. Subsite spacing 
is detemiined by a binding site selection method as is common to those skilled in the art 
5 (Thiesen, et al., Nucleic Acids Research, lfi:3203, 1 990). Example of the recognition of 
an extended asynunetiic sequence is shown by Zif(C7) ^Jun/Zif-268-Fos dimer. This 
protein consists of 6 fingers of the C7 type (EXAMPLE 1 1) linked to Jun and three 
fingers of Zi£268 linked to Fos, and recognizes the extended sequence: 

5' - CGC CGC CGC CGC CGC CGC |n| GCG TGG GCG - 3' 
10 3' - GCG GCG GCG GCG GCG GCG Wx CGC ACC CGC - 5' 

(SEQIDNO: 38) 



Oxidadve or hydrolytic cleavage of DNA or KNA with metal chelate complexes can be 
performed by methods known to those skilled in the ait In another embodiment, 
attachment of chelating groiqis to Zif proteins is preferably facilitated by the 

1 5 incorporation of a Cysteine (Cys) residue between the initial Methionine (Met) and the 
first Tyrosme (Tyr) of the protein. The Cys is then alkylated with chelators known to 
those skilled in the art, for example, EDTA derivatives as described (Sigman, 
Biochemistry, 22:9097, 1990). Alternatively the sequence Gly-Gly-His can be made as 
tiie most amino terminal residues since an amino traminus composed of the residues has 

20 been described to chelate Ctt+2 (Mack, et al.. J. Am. Chem. Soc. llfl:7572, 1988). 
Preferred metal ions include Cu+2, Ce^-3 (Takasaki and Chin, J. Am. Chem. Soc, 
116:1 121, 1994) ZiH-2, Cch-2, PlH-2, Fe*2 (Sdmaith, et al., Proc. Natl. Acad Sci. USA, 
21:569, 1 994), Fe4-3,Ni+2,Ni+3,Laf 3, Eu+3 (Hall, croi. Chemistry and Biology, liUS, 
1994), Gd+3, Tb+3, LiH-3 Mn+2, Mg+2. Cleavage with chelated metals is generally 

25 performed in the presence of oxidizing agents such as O2, hydrogen peroxide H2O2 and 
reducing agents such as thiols and ascorbate. The site and strand (+ or - site) of cleavage 
is determined empirically (Mack, et al, J. Am. Chem. Soc., ilQ:7572, 1988) and is 
dependent on the position of the Cys between the Met and the Tyr preceding the first 
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finger. In the protein Met (AA) Tyr-(Zif),.i2, the chelate becomes Met-(AA),, Cys- 
Chelate-(AA);a-Tyr-(Zif),.,2, where AA = any amino acid and x = the number of amino 
acids. Dimetic af constructs of the type Zif-Jun/Zif-Fos are preferred for cleavage at two 
sites within the target oligonucleotide or at a single long target site. In the case where 
5 double stranded cleavage is desired, both Jun and Fos containing proteins are labelled 
with chelators and cleavage is performed by methods known to those skilled in fte act 
In this case, a staggered double-stranded cut analogous to tiiat produced by restriction 
en^mes is generated. 

Following mutagraiesis and selection of variants of the ZiflSi protein in vMch the finger 
10 1 specificity or afOnity is modified, proteins carrying multiple copies of the finger may 
be constructed using the TGEKP linker sequence by methods known in the art For 
example, ^ C7 finger may be constructed according to the scheme: 

MKLLEPYACPVESCDRRFSKSADLKRHIRHIQEKE- 

(YACPVESCDRRFSKSADLKHIRIHIQEE^ i-ii, (SEQ IDNO: 39) where the sequence 
15 oftiie last linker is sutgect to change since it is at the toininm and not involved in linkii^ 
two fingers together. Tins protein binds the designed target sequence GCG-GCG-GCG 
(SEQ IDNO: 32) in the oHgonudeotide hairpin CCT-CGC-CGC-CGC-GGG-TTT-TCC- 
CGC-GCC-CCC GAG G (SEQ ID NO: 40) witii an afiSnity of 9nM, as compared to an 
afiBnity of 300 nM for an oligonucleotide encoding the GCG-TGG-GCG sequence (as 
20 detemiined by sur&ce plasmon resonance studies). Fingers utilized need not be identical 
and may be mixed and matched to produce proteins which recognize a desired target 
sequence. These may also be utilized with leucine zippers (e.g., Fos/Jim) or other 
heterodimers to produce proteins with extended sequence recognition. 

In addition to producing polymers of finger 1, tiie aitire three finger Zif268 and modified 
25 versions tiiffltein may be fused using the consensus linker TGEKP to produce proteins with 
extended recognition sites. For example, the protein ZiQ68-Zif268 can be produced in 
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vMch the natural protein has been fused to itself using the TGEKP linker. This protein 
now binds the sequence GCG-TGG-GCG-GCG-TGG-GCG. Therefore modifications, 
within the three fingers of Zif268 or other zmc finger proteins known in the art may be 
fiised together to form a protein w^iich recogm2es extended sequences. These new zinc 
5 proteins may also be used in combination with leucine zippers if desired. 

The invention now being fully described, it will be {^iparent to one of ordinary skill in fte 
art that various changes and modifications can be made without departing fix)m the spirit 
or scope of the invention. 



wo 95/19431 



PCr/US95rt»0829 



-53- 
EXAMPLES 

A recombinant polypeptide containing three of nine of the TFlllA zinc fingers (Clemens, 
et al, Proc. Natl Acad. Sci.. USA. 52:10822, 1992) has been generated by polymerase 
chain reaction (PCR) amplification fit)m the cDNA for TFUIA and expression in E. coli. 
5 The recombinant protein, termed zfl -3, was purified by ion exchange chromatography 
and its binding site within the SS gene was deteimined by a combination of DNase I 
footprinting and landing to synthetic oligonucleotides (Liao, et al., J. Mol. Biol, ^1:857, 
1992). Hie examples provide experiments \^ch show that the binding of this 
polypeptide to its recognition sequence placed close to an active RNA polymerase 

10 promoter could inhibit the activity of tiiat promoter in W<ro. To provide such a test 
system, a 26 bp oligonucleotide containing the 13 bp recognition sequence for zfl -3 was 
cloned into ib& polylinker region of plasmid pUC19 near the promoter sequence for T7 
RNA polymerase. Hie DNA binding activity of our preparation of recombinant zfl -3 was 
determined by gel mobility shift analysis with fbe oligonucleotide containing the binding 

1 5 site. In addition, in vitro transcription was performed with T7 RNA polymerase in the 
presence or absence of the same amounts of tiie zfl-3 polypeptide used in the DNA 
binding titration. For each DNA molecule bound by zfl -3, that DNA molecule is 
rendered inactive in transcription. In fliese examples, dierefore, a zinc finger polypeptide 
has been produced vMch fully blocked die activity of a promoter by binding to a nearby 

20 target sequence. 



wo 95/19431 



PCT/US95/00829 



-54- 



EXAMPLEl 
SEQUENCE-SPECIF IC CENE TARGETING 
BY ZINC FINGER PROTEINS 

A. From the crystal stiucture of zi£268, it is clear that specific histidine (non-zinc 
5 coordinating his residues) and arginine residues on the surface of the a-helix, the finger 
tip, and at helix positions 2, 3, and 6 (immediately preceding the conserved histidine) 
participate in hydrogen bonding to DNA guanines. As the number of structures of zinc 
iBnger complexes continues to increase, it will be likely that different amino acids and 
different positions may participate in base specific recognition. FIGURE 2 (panel A) 
10 shows the sequmceofthetiireeamino-teriimial fingers ofTI^niA with basic aniino acids 
at these positions underiined. Similar to finger 2 of the regulatory protein ziQ68 (Krox- 
20) and fingers 1 and 3 of Spl , finger 2 of TFIIIA contains histidine and aiginine residues 
at these DNA contact positions; further, each of these zinc fingers minimally recognizes 
the sequence GGG (FIGURE 2, panel B) within the 5S gene promoter. 

15 A recombinant polypeptide containing these three TFIIIA anc fingers has been generated 
by polymerase chain reaction (PGR) amplification fmm the cDNA for TFIIIA and 
e}q)ression in R coli (Clonens, et al., siqjra). An experiment was designed to determine 
whether the binding of this polypeptide to its recognition sequence, placed close to an 
active RNA polymerase promoter, would inhibit the activity of tiiat promoter in vitro. 

20 The following expoimmts were done to provide such a test system. A 23 bp 
oligonucleotide (Liao, etal. 1992, supra) containing the 13 bp recognition sequence for 
zfl-3 was cloned into the polylinker region of plasmid pBluescript SK+ (Stratagene, La 
JoUa, CA), near fte promoto* sequence for T7 RNA polymoase. Hie parent plasmid was 
digested with the restriction enzyme £coRV and, after dephosphorylation with calf 

25 intestinal alkaline phosphatase, the phosphorylated 23 bp oligonucleotide was inserted by 
ligation with T4 DNA ligase. The ligation product was used for transformation of DH5o 
E. coli cells. Clones harboring 23 bp inserts were identified by restriction digestion of 
miniprep DNA. The success of cloning was also verified by DNA sequence analysis. 
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Thc DNA binding activity of the preparation of recombinant zfl-3 was also determined 
by gel mobility shift analysis with a 56 bp radiolabeled EcoRI/XhoI restriction fragment 
doived fiom the cone containing the binding site for zfl-3 and with the radiolabeled 23 
bp oligonucleotide. Gel shift assays were done as described (Liao, et al, supra; Fried, et 
5 al, Nucl. Acids., Res., £:6505, 1981). The result of the latter analysis is shown in 
FIGURE 3. Binding reactions (20 /A) also contained 1 fxg of unlabeled plasmid DNA 
harboring the same 23 bp sequence. In lanes 2-12, the indicated amounts of zfl-3 were 
also included in the reactions. After incubation at ambient temperature for 30 min, the 
samples were subjected to electrophoresis on a 6% nondenaturing polyaciylamide gel in 

1 0 88mM Tris-borate, pH 83, buffer. In each reaction, a trace amount of the radiolabeled 
oligonucleotide was used with a constant amount (1 (ig) of plasmid DNA harboring the 
zfl-3 Unding site. Hie reactions of lanes 2-12 contained increasing amounts of the zfl-3 
polypeptide. Theautoradiogramof^gel isshown. The results indicate that binding of 
zfl-3 to the radiolabeled DNA caused a retardation of electrophoretic mobility. The 

1 5 percentage of radiolabeled DNA molecules bound by zfl-3 also reflects die percentage 
of unlabeled plasmid DNA molecules boimd. 

In vitro transcription experiments were performed with T7 RNA polymerase in the 
presence or absence of the same amounts of the zfl-3 polypeptide used in the DNA 
binding titration wifli idoitical amounts of Ae plasmid DNA haiboring the zfl-3 binding 

20 site. Each reaction contained, in a volume of 25 /J, 1 ng of i*vi/II-digested pBluescript 
SK+DNA contaming the 23 bp bmding site for zfl-3 inserted in the EcoRW site of the 
vector, 40 units of RNasin, 0.6 mM ATP+UTP-fCTP, 20 (M GTP and 10 (j.Ci of o-^^- 
GTP and 10 units of T7 RNA polymerase (Stratagraie). The reaction buflFer was provided 
by Stiatagene. After incubation at 37"C for 1 hour, the products of transcription were 

25 purified by phenol extraction, concentrated by ethanol precipitation and analyzed on a 
denaturing polyacrylamide gel. T7 transcription was monitored by the incorporation of 
radioactive nucleotides into a run-off transcript FIGURE 4 shows an autoradiogram of 
a denaturing polyacrylamide gel analysis of the transcription products obtained. In this 
experiment, the plasmid DNA was cleaved with the restriction enzyme PvuU and the 
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expected length of the run-off transcript v/as 245 bases. Addition of zfl-3 polypeptide 
to the reaction repressed transcription by T7 RNA polymerase. 

FIGURE 5 shows a gr^h in vAdch the percentage of DNA molecules bound by zfl -3 in 
the DNA gel mobility shift assay (x-axis) versus the percoitage of inhibition of T7 KNA 
5 polymerase transcription by the same amounts of zfl -3 (y-axis) has been plotted. Note 
that each data point corresponds to identical amoimts of zfl -3 used in tiie two assays. The 
one-to-one correspondence of the two data sets is unequivocal. T7 transcription was 
monitored by the incorporation of radioactive nucleotides into a run-off transcript 
Transcription was quantitated by gel electrophoresis, autoradiography and densitometry. 
10 Gel mobility shift assays were quantitated in a similar feshion. For each DNA molectile 
boimd by zfl-3, &at DNA molecule is lendoped inactive in transcription. In this 
experiment, therefore, a zinc finger polypeptide has fully blocked the activity of a 
promoter by binding to a nearby target sequence, 

B. Since the previous experiment was performed with a prokaryotic RNA polymerase, 
15 ibs following experiment was performed to determine ^K^ietho' the zinc finger polypeptide 
zfl-3 could also block the activity of a eukaiyotic RNA polymerase. To test this, a 
transcription extract prepared fiiom unfertilized Xenopus eggs (Hartl, et al., J. Cell Biol., 
I2Q:613, 1993) and the Xenopus SS RNA gent template was used. These extracts are 
highly active in transcription of SS RNA and tRNAs by RNA polymraase m. As a test 
20 tenq>late, the SS RNA gene v4iich naturally contains Ae binding sites for TFIHA and zfl - 
3, was used. Each reaction contained 10 /il of a high speed supernatant of tiEte egg 
homogenate, 9 ng of TFEdA, nucleoside triphosphates (ATP, DTP, CTP) at 0.6 mM and 
lOAtCi of a-^-GTP and GTP at 20 in a 2S m1 reaction. All reactions contained 1 80 
ng of a plasmid DNA harboring a single copy of tiie Xenopus somatic-type SS RNA gene, 
25 and the reactions of lanes 2 and 3 also contained 300 ng of a Xenopus tRNAmet gene- 
containing plasmid. Prior to addition of the Xenopus egg extract and I t'UlA, 0.2 and 0.4 
fj.g of zfl-3 were added to the reactions of lanes 2 and 3, req)ectively. The amount of zfl- 
3 used in the experiment of lane 2 was sufBcient to bind all of the SS gene-containing 
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DNA in a sq)arate binding reaction. Afto* a IS min. incubation to allow binding of zfl-S 
to its recognition sequence, the other reaction components were added. After a 2 hour 
incubation, the products of transcription were purified by phenol extraction, concentrated 
by ethanol precipitation and analyzed on a denaturing polyacrylamide gel. The 
5 autoradiogram is shown in FIGURE 6. FIGURE 6 also shows the result of a controlled 
reaction in which no zinc finger protein was added (lane I). As a control, lanes 2 and 3 
also contained a tRNA gene template, v/ladi lacks the binding site for TFIIIA and zfl-3. 
SS RNA transcription was rqniessed by zfl -3 while tRNA transcription was imaffected. 
These results demonstrate that zfl-3 blodcs the assembly of a eukaryotic RNA 
10 polymerase m transcription complex and shows that this effect is specific for DNA 
molecules that harbor the binding site for the recombinant zinc finger protein derived 
from TFIIIA. 

Three-dimensional solution structures have been determined for a protein containing the 
first tiiree 2anc fingers of TFIIIA using 2D, 3D, and 4D NMR metiiods. For this purpose, 

15 the protein was expressed and purified from E. coli and uniformly labeled with and 
The NMR structure shows that tiie individual zinc fingers fold into the canonical 
finger structure with a small p-^eet packed against an a-helix. The fingers are not 
entirely indepoident in solution but thoe is evidraice of subtie interactions between tiiem. 
Using similar techniques the 3D structure of a complex between zf 1-3 and a 13bp 

20 oligonucleotide corresponding to its specific binding site on the SS RNA gene is 
determined and used to provide essential information on tiie molecular basis for sequence- 
specific nucleotide recognition by the TFIIIA zinc fingers. This information is in turn 
used in designing new zinc finger derived-nucleotide binding proteins for regulating the 
preselected target genes. Similar NMR methods can be applied to determine the detailed 

25 stiuctures of the conc^lexes formed between designed zuic finger proteins and their target 
genes as part of a structure-based ^proach to refine target gene selectivity and enhance 
binding aftinity. 
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EXAMPLE2 

ISOLATION OF NOVEL ZINC FINGER-NUCLEOTIDE 
BINDING PROTEINS 



In order to rapidly sort large libraries of zinc finger variants, a phage surfoce display 
5 system initially developed for antibody Ubraries (Baibas, et al.. METHODS. 2:1 19, 1991) 
was used. To this, end, pComb3 has been modified for zinc finger selection. The 
antibody light chain promoter and cloning sequences have been removed to produce a 
new vector, pComb3.S. The zi£268 three finger protein has be«i modified by PCR and 
inserted into pComb3.5. The zinc fingers are functionally displayed on ibs phage as 

1 0 detemiined by solid phase assays v^di demonstrate ^t phage bind DNA in a sequence 
dependent fiishion. Site-directed mutagenesis has been performed to insert an Nsil site 
between fingers 1 and 2 in order to fecilitate library construction. Furthermore, ziC68 
is functional vibea fiised to a decapepdde tag vHuch allows its binding to be conveniendy 
monitored. An initial library has been constructed using overlap PCR (Barbas, et al., 

1 5 Proc. Natl. Acad Sci. USA, 22:4457, 1992) to create finger 3 variants vAisn 6 residues 
on die amino terminal side of the a helix involved in recognition were varied with an 
NNK doping strategy to provide deg^ioacy. Tliis tiiinl fingo* originally bound the GCG 
3 bp subsite. Selection for binding to an AAA subsite revealed a consensus patton 
appearing in tiie selected sequences. 
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The ziC68 containing plasmid, pZif89 (Pavletich, et al. Science 252:809, 1991), was 
used as the source of ziG68 DNA for modification of the zinc fingers. Briefly, pZifB9 
was cloned into die plasmid, pComb3.S, after amplification by PCR using the following 
primers: 

5 ZF: 5'-ATG AAA CTG CTC GAG CCC TAT GCT TGC CCT GTC GAG-3' 
(SEQUENCE ID NO. 2) 

ZR: 5'-GAG GAG GAG GAG ACT AGT GTC CTT CTG TCT TAA ATG GAT TTT 
GGT-3' (SEQUENCE E) NO. 3). 

The PCR reaction was performed in a 100/zl reaction containing 1 12% of each of 
oligonucleotide primers ZF and ZR, dNTPs (dATP, dCTP, dGTP, dTTP), 1 .5mM MgCla 
Taq polymerase (5 units) 10 ng template pZif89, and 10|d 10 x PCR buffer (Perkin - 
Ehner Corp.). Hiirty rounds of PCR amplification in a Peikin - Elmer Cetus 9600 Gene 
Amp PCR system tfiermocycler were performed. The amplification cycle consisted of 
denaturing at 94°C for one minute, annealing at 54''C for one minute, followed by 
extrosion at ll'C for two minutes. The resultant PCR amplification products vi&K gel 
purified as described below and digested widi XhoVSpel and ligated into pComb3.S. 
pComb3.5 is a variant of pComb3 (Baibas, et al. Proc. Natl. Acad. Scl, USA. M:7978, 
1991) which has the light diain region, include ig its lacZ promoter, removed. Briefly, 
pComb3 was digested with Nhel, klenow treated, digested with Xbal, and religated to 
form pComb33. O&er similar vectors v^ch could be used in place of pComb3.S, sudi 
as Surf Ziq)™ (Stratagene, La Jolla, CA), will be known to those of skill in the aft 

The phagemid pComb3.S containing ziQ68 was then used in PCR amplifications as 
described herein to introduce nucleotide substitutions into the zinc fingers of zif268, to 
produce novel zinc fingers wdiich bind to specific recognition sequences and which 
25 enhance or repress transcription after binding to a given promoter sequence. 
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The methods of producing novel zinc fingers with particular sequence recognition 
specificity and regulation of gene expression capabilities involved the following steps: 

A first zinc finger (e.g.. Zinc finger 3 of zif268) was first randomized through the 
use of overlap PGR; 

Amplification products fit>m the overlap PGR containing randomized zinc fingers 
were ligated back into pComb3.S to form a randomized library; 

Following expression of bacteriophage coat protein ID-anchored zinc finger fit>m 
ihs library, the sur&ce protein ejqnessing phage were paimed against specific zinc 
finger recognition sequences, resulting in the selection of several specific 
randomized zinc fingers; and 

Following selection of sequence-specific zinc fingers, the corresponding phagemids 
were sequenced and the amino acid residue sequence was derived therefinm. 

EXAMPLE 3 

PREPARATION OF RANDOMIZED ZINC FINGERS 

15 To randomize tiieancfingasofzi£Z68 in pComb3.S, described above, two separate PGR 
amplifications were performed for each finger as described herein, followed by a third 
ovedsp PGR amplification tiiat resulted in the annealing of the two previoiis amplification 
products, followed by a tiiird amplification. The nucleotide sequence of zinc finga of 
af268 of template pGomb3.5 is shown in nOURE 7 and is listed in SEQUENGE ID NO. 

20 4. The nucleotide positions tiiat were randomized in zinc finger 3 began at nucleotide 
position 217 and ended at position 237, excluding serine. The template zi068 sequence 
at that specified site encoded eight total amino acid residues in Gnger 3. This amino acid 
residue sequence of finger 3 in pComb3.S which was to be modified is ^£g-Ser-ASE-£ilu- 
Affi-Lxs-Alg-His (SEQUENGE ID NO. 5). The underiined amino acids represent those 



5 2. 
3. 

10 

4. 
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residues which were randomized. 

A pool of oligonucleotides vMch included degenerate oligonucleotide primers, 
designated BZF3 and ZF36K. and non-degenerate primers R3B and FTX3 having the 
nucleotide formula described below, (synthesized by Operon Technologies, Alameda, 
5 CA), were used for randomizing the zinc finger 3 of ziC68 in pCombS .5 . The six triplet 
codons for introducing randomized nucleotides included the repeating sequence NNM 
(complement of NNK), v/bste M can be either G or C and N can be A, C, G or T. 

The first PGR an^lification resulted in the amplification of the S' region of the zinc finger 
3 fi:agment in the pComb3.S phagemid vector clone. To amplify this region, &e 

10 following primer pairs were used. The S' oligonucleotide primer, FDG, having the 
nucleotide sequence 5'-GCA ATT AAC CCT CAC TAA AGG G-3" (SEQUENCE ID 
NO. 6), hybridized to the noncoding strand of finger 3 corresponding to the region 5' 
(including the vector sequence) of and including tiie first two nucleotides of zi£268. The 
3' oligonucleotide primer, BZF3, having the nucleotide sequence 5'-GGC AAA CTT CCT 

1 5 CCC ACA AAT-3' (SEQUENCE ID NO. 7) hybridized to the coding strand of the finger 
3 beginning at nucleotide 216 and aiding at nucleotide 196. 

The PGR reaction was performed in a 100 microliter (ul) reaction containing one 
microgram (ug) of each of oligonucleotide primers FTX3 and BZF3, 200 millimolar 
(mM) dNTP's (dATP, dCTP, dGTP, dTTP), 1.5 mM MgClz Taq polymerase (5 units) 

20 (Peridn-Elmer Corp., Norwalk, CT), 10 nanograms (ng) of template pComb3.S zi£268, 
and 10 ul of lOX PGR buffer purchased conunercially (Perkin-Elma: Corp.). Thirty 
rounds of PGR amplification in a Peridn-Elmer Getus 9600 GeneAmp PGR System 
thomocycler were then performed. The amplification cycle consisted of denaturing at 
94°G for 30 seconds, annealing at SO'C for 30 seconds, followed by extension at 72°C 

25 for one minute. To obtain sufBcient quantities of amplification product, 30 identical PGR 
reactions were performed. 
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The resultant PCR amplification products were then gel purified on a 1.5% agarose gel 
using standard electroelution techniques as described in "Molecular Cloning: A 
Laboratory Manual", Sambrook, et al, eds.. Cold Spring Harbor, NY (1989). Briefly, 
after gel electrophoresis of the digested PCR amplified zinc finger domaro, the region of 
5 the gel containing die DNA fiagments of predetermined size was excised, electroeluted 
into a dialysis membrane, ethanol precipitated and resuspended in buffer containing 1 0 
mM Tris-HCl, pH 7.5 and 1 mM EDTA to a final concentration of SO ng/ml. 

The purified resultant PCR amplification products fix>m the first reaction were then used 
in an overly extension PCR reaction with the products of the second PCR reaction, botii 
10 as described below, to recombine tiie two products into reconstructed zi£268 containing 
randomized zinc fingers. 

The second PCR reaction resulted m the amplification of the 3' end of zi£268 finger 3 
overi£q)ping witii the above products and extending 3' of finger 3. To amplify this region 
for randomizing the encoded eight amino acid residue sequence of finger 3, the following 

15 primer pairs were used. The 5' coding oUgonucleotide primer pool was designated 
ZF36K and had tiie nucleotide sequraice represented by tiie formida, 5'-ATT TGT GGG 
AGG AAG TTT GCC NNK AGT NNK NNK NNK NNK NNK CAT ACC AAA ATC 
CAT TTA-3' (SEQUENCE ID NO. 8) (nucleotides 196-255). The 3' noncoding primer, 
R3B, hybridized to the coding strand at the 3' end of gene III (glll) having the sequence 

20 5'-TTG ATA TTC ACA AAC GAA TGG-3' (SEQUENCE ID NO. 9). The region 
between the two q)ecified ends of the primer pool is represented by a 15-mer NNK 
degeneracy. The second PCR reaction was performed on a second aliquot of pComb3.5 
template in a 100 ul reaction as described above containing 1 . ug of each of 
oligonucleotide primers as described. The resultant PCR products encoded a diverse 

25 population ofrandomizedzi068 finger 3 regions of 8 amino acid residues in length. The 
products were then gel purified as described above. 
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For the annealing reaction of the two PGR amplifications, 1 //g each of gel purified 
products fix)m the first and second PGR reactions were then admixed and fiised in the 
absence of primers for 35 cycles of PGR as described above. The resultant fusion product 
was then amplified wiA 1 ug each of FTX3 and R3B oligonucleotide primers as a primer 
5 pair in a final PGR reaction to form a complete zi£268 fiiagment by overlap extension. 
The overly PGR amplification was poformed as described for other PGR amplifications 
above. 

To obtain sufBcient quantities of amplification product, 30 idoitical overlap PGR 
reactions were poformed. The resulting fi:agments extended ftom 5' to 3' and had 

10 randomized finger 3 encoding 6 amino acid residues. The randomized zif268 
amplification products of approximately 450 base pairs (bp) in Iraigtfa in each of the 30 
reactions were first pooled and thai gel purified as described above and cut with ATioI and 
Spel, prior to their relegation into the pComb3.5 sur&ce display phagemid expression 
vector to form a library for subsequent screening against zinc finger recognition sequence 

15 oligos for selection of a specific zinc finger. The ligation procedure in creating 
expression vector libraries and tiie subsequent expression of the a£268 randomized 
pGomb3.5 clones was performed as described below in Example 4. 

Nucleotide substitutions may be performed on additional zinc fingers as well. For 
exan^le, in a£268, fingexs 1 and 2 may also be modified so that additional binding sites 

20 may be identified. For modification ofzincfingra: 2, primers FTX3 (as described above) 
and ZFNsi-B, 5*-CAT GCA TAT TCG ACA GTG GAA-3' (SEQUENGE ID NO. 10) 
(nucleotides 100-120) are used for tiie first PGR reaction, and R3B (desaibed above) and 
ZF2r6F (5'-CAG TGT CGA ATA TGG ATG GGT AAG TTG (NNK)^ AGG AGG GAG 
ATG GGG AGG CAG-S*) (SEQUENGE ID NO. 1 1) (nucleotides 103 to 168) are used for 

25 the second reaction. For modification of finger 1, RTX3 (above) and ZFI6rb (5'-GTG 
GGG TOT GTG GAT GGG GAT ATG (MNN), GGA MNN AGA AAA GGG GGG ATG 
GGA GGA-3') (SEQUENGE ID NO. 12) (nucleotides 28 to 93) are used for the first 
reaction and ZFIF (5'-GAT ATG GGG ATG GAG AGA GGG GAG-3') (SEQUENGE ID 
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NO. 13) (nucleotide 70 to 93) and R3B (above) are used in the second reaction. The 
overly reaction utilizes FTX3 and R3B as described above for finger 3. Preferably, each 
finger is modified individually and sequentially on one protein molecule, as opposed to 
all three in one reaction. The nucleotide modifications of finger 1 of zi£268 would 
5 include the underlined amino acids R S D E L T R H, (SEQUENCE ID NO. 14) winch 
is encoded by nucleotides 49 to 72. The nucleotide modifications of finger 2 of zif268 
would include S R S D H L (SEQUENCE ID NO. 1 5), vMch is encoded by nucleotides 
130 to 147. (See FIGURE 7). 

EXAMPLE 4 

10 PREPARATION OF PHAGEMTD-DISPLAYED SEQUENCES 

HAVING RANDOMIZED ZINC FINGERS 



The phagemid pComb3.S containing ziQ68 sequences is a phagemid expression vector 
that provides for the e^qnession of phage-displayed anchored proteins, as described above. 
The original pComb 3 expression vector was designed to allow for anchoring of expressed 

1 5 antibody proteins on the bacteriophage coat protein 3 for tiiie cloning of combinatorial Fab 
libraries. Xhol and Spel sites were provided for cloning complete PCR-amplified heavy 
chain ^d) sequences consisting of the region beginning with fiamewoik 1 and extraiding 
through fiamework 4. Gene m of filamraitous phage encodes this 406-residue minor 
phage coat protein, q>in (cp3), which is expressed prior to extrusion in the phage 

20 assembly process on a bacterial monbrane and accumiilates on the inner membrane facing 
into the periplasm ofR coll 

In this system, the first cistron encodes a periplasmic secretion signal (pelB leader) 
operatively linked to the fiision protein, zif268-cpIII. The presence of the pelB leader 
fecilitates the secretion of both tiie fusion protein containing randomized zinc finger ftom 
25 the bacterial cytoplasm into the periplasmic space. 
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By this process, the ziC68-qjin was dehvered to the periplasmic space by the pelB leader 
sequence, which was subsequently cleaved. The randomized zinc finger was anchored 
in the membrane by the cpIU membrane anchor domain. The phagemid vector, 
designated pComb3.S, allowed for sur&ce display of the zinc finger protein. The 
5 presence of the XhoVSpel sites allowed for the ins^on of XhoVSpel digests of the 
randomized zif268 PCR products in the pCcmb3.S vector. Thus, the ligation of the 
zi£268 mutagemzed nucleotide sequence prepared in Example 3 resulted in the in-fiame 
ligation of a complete zif268 fiagment consisting of PCR iamplified Gnger 3. The cloning 
sites in the pComb3.S expression vector were compatible with previously reported mouse 
10 and human PCR primers as described by Huse,c/ai:, Science. 246:1275-1281 (1989)and 
Peisson, et al.. Proc. Natl. Acad ScL. USA, 88:2432-2436 (1991). The nucleotide 
sequence of the pelB, a leader sequence for directing the expressed protein to the 
periplasmic space, was as reported by Huse, et al., supra 

The vector also contained a ribosome binding site as described by Shine, et al.. Nature, 
1 5 254:34. 1975). The sequence of the phagemid vector, pBluescript, vAuch includes ColEl 
and Fl origins and a betar-lactamase gene, has been previously described by Short, et al., 
Nuc. Acids Res.. 1^:7583-7600, (1988) and has the GenBank Accession Number 52330 
for tiie complete sequence. Additional restriction sites. Sail, Acc\, HincU, CM, HindHl, 
£i»RV, i'j/I and S>md, located between the ATioI and sites of the enqjty vector were 
20 derived fix>m a 51 base pair stu£fer ^agmeat of pBluescript as described by Short, et al.. 
supra. A nucleotide sequence that encodes a flexible 5 amino acid residue tether 
sequCTce v^ch lades an ordered secondary structure was juxt£^sed between the Fab aiid 
q)3 nucleotide domains so that interaction in the expressed fusion protein was minimized. 

Thus, the resultant combinatorial vector, pComb3.5, consisted of a DNA molecule having 
25 a cassette to exptess a fiision protein, zif268/cp3. The vector also contained nucleotide 
residue sequences for the following operatively linked elements listed in a 5' to 3' 
direction: the cassette consisting of LacZ promoter/operator sequences; a NoA restriction 
site; a ribosome binding site; a pelB leader, a spacer region; a cloning region bordered by 
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5' Xhol and 3' Spel restriction sites; the tether sequence; and the sequences encoding 
bacteriophage cp3 followed by a stop codon. A Nhel restriction site located between the 
original two cassettes (for heavy and light chains); a second lacZ promoter/operator 
sequence followed by an expression control ribosome binding site; a pelB leader, a spacer 
5 region; a cloning region bordered by 5' Sad and a 3' A6af restriction sites followed by 
e^qjression control stop sequences and a second Notl restriction site were deleted fiom 
pComb3 to form pComb 3.5. Those of skill in the art will know of sinular vectors that 
could be utilize in the metiiod of the invention, such as tiie Surf Zap™ vector (Stratagene, 
LaJolla,CA.). 

10 In the above esqnession vector, tiie zi£268/cp3 fusion protein is placed under the control 
of a lac promotei/operator sequence and directed to the periplasmic space by pelB leader 
sequences for functional assembly on the membrane. Inclusion of the phage F 1 intergenic 
region in the vector allowed for the packaging of single-stranded ph^emid with the aid 
of helpo* phage. The use of helper phage siq)erinfection allowed for the raqiression of two 

15 forms of cp3. Consequentiy, normal phage morphogenesis was pertuibed by competition 
between the Fd/cp3 fiision and the native cp3 of the helper phage for incorporation into 
the virion. The resulting packaged phagemid carried native cp3, w^iich is necessary for 
infection, and the encoded fiision protein, vMch is displayed for selection. Fusion with 
tiie C-terminal domain was necessitated by the phagenud :q)proach because fosion with 

20 the infective N-terminal domain would render the host cell resistant to infectioiL 



The pComb3 and 3.5 eiqxression vector described above forms the basic constiuct of the 
display phagemid expression vector used in this invention for the production of 
raiidomized zinc finger proteins. 
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EXAMPLE5 
PHAGEMID LfflRARY CONSTRUCTION 

In order to obtain expressed protein representing randomized anc fingers, phagemid 
libraries were constructed. The libraries provided for surfece expression of recombinant 
5 molecules where zinc fingers were randomized as described in Example 3. 

For preparation of phagemid libraries for expressing the PCR products prepared in 
Example 3, the PCR products were first digested with Xhol and Spel and separately 
ligated with a similarly digested original (i.e., not randomized) pComb3.S phagemid 
expression vector. The Xhol and Spel sites were present in the pComb3.5 vector as 

1 0 described above. The ligation resulted in operatively linking the zif268 to the vector, 
located 5' to Ae q>3 geac. Since the amplification products were inserted into the 
template pComb3.S expression vector &at originally had the heavy chain variable domain 
sequences, only the heavy diain domain cloning site was replaced leaving flie rest of the 
pComb3.5 e^qnession vector unchanged. Upon expression fiom the recombinant clones, 

15 the expressed proteins contained a randomized zinc finger. 

Phagemid libraries for expressing each of the randomized zinc fingers of this invention 
were prepared in the following procedure. To form circularized vectors containing the 
PCRproduct insert, 640 ng of the digested PCR products were admixed with 2 ug of the 
linearized pComb3.S phagemid vector and ligation was allowed to proceed overnight at 

20 room tranperature using 10 units of BRL ligase (Gai&ersburg, MD) in BRL ligase bufier 
in a reaction volume of ISO ul. Five sepaxabs ligation reactions were performed to 
increase the size of the phage library having randomized anc fingers. Following the 
ligation reactions, the circularized DNA was precipitated at -20''C for 2 hours by the 
admixture of 2 ul of 20 mg/ml glycogen, 15 ul of 3 M sodium acetate at pH 5.2 and 300 

25 ul of edianol. DNA was then pelleted by miraocentrifiigation at 4»C for 15 minutes. The 
DNA pellet was washed with cold 70% ethanol and dried under vacuum. The pellet was 
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resuspended in 10 ul of water and transformed by electroporation into 300 ul of E. coli 
XL 1 -Blue cells to form a phage library. 

After transfoimation, to isolate phage expressing mutagenized finger 3, phage were 
induced as described below for subsequent panning on a hairpin oligo having the 
5 following sequence (SEQUENCE ID NO. 16): 

NHj-CGT-AAA- TGG - GCG -CCC - T 

T 
T 

10 GCA-TTT-ACC-CGC-GGG - T 

The bold sequence indicates the new zinc finger 3 binding site (formerly GCG), the 
underlined sequence refnesents the finger 2 site and Ae double underlining represents the 
finger 1 binding site. 

15 Transformed K coli were grown in 3 ml of SOC medium (SOC was prepared by 
admixture of 20 grams (g) bacto-tryptone, S g yeast extract and O.S g NaCI in 1 liter of 
water, adjusting the pH to 7.S and admixing 20 ml of glucose just before use to induce the 
esqnession of the a£268-qpIII), were admixed and the culture was shaken at 220 rpm for 
1 hour at 37''C. Following this incubation, 10 ml of SB (SB was jaepaied by admixing 

20 30 g tiyptone, 20 g yeast extract, and 10 g Mops buffer per liter with pH a^usted to 7) 
containing .20 ug/ml carbenicillin and 10 ug/ml tetracycline were admixed and the 
admixture was shaken at 300 ipm for an additional hour. This resultant admixture was 
admixed to 100 ml SB containing SO ug/ml carbenicillin and 10 ug/ml tetracycline and 
shaken for 1 hour, after vMch helper phage VCSM13 (10" pfii) were admixed and the 

25 admixture was shaken for an additional 2 hours at 37''C. After ^s time, 70 ug/ml 
kanamycin was admixed and maintained at 30''C overnight The lower temperature 
resulted in better expression of zii268 on the surface of the phage. The supernatant was 
cleared by centrifugation (4000 rpm for IS minutes in a JAIO rotor at 4''C). Phage were 
precipitated by admbcture of 4% (w/v) polyethylene glycol 8000 and 3% (w/v) NaCl and 

30 maintained on ice for 30 mintites, followed by centrifiigation (9000 rpm for 20 minutes 
in a JAIO rotor at 4°C). Phage pellets were resuspended in 2 ml of buffer (SmM DTT, 
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lOmMTiis-HCl, pH 7.56, 90niM KCl, 90mM ZnClj, ImM MgClj and microcentrifuged 
for three minutes to pellet debris, transferred to fresh tubes and stored at -lO'C for 
subsequent screening as described below. DTT was added for refolding of the 
polypeptide on the phage surface. 

5 For determining the titering colony forming units (cfa), phage (packaged phagemid) were 
diluted in SB and 1 ul was used to infect SO ul of fresh (A<Hxoo = 1) £>.£Q!i XLl-Blue cells 
grown in SB containing 10 ug/ml tetracycline. Phage and cells were maintained at room 
temperature for IS minutes and then directly plated on LB/caibenidllin plates. The 
randomized zinc finger 3 library consisted of S x 10^ PFU total. 

10 Multiple Paimingg of the Phage Library 

The phage library was panned against fbe hairpin oligo containing an alt«?ed binding site, 
as described above, on coated microtiter plates to select for novel zinc fingers. 

The pamiing procedure used, comprised of several rounds of recognition and replication, 
was a modification of that originally described by Parmley and Smith (Parmley, et al., 

15 Gene, 2i:30S-318, 1988; Barbas, et al.. 1991, stqira.). Five rounds of panning were 
performed to enrich for sequence-specific binding clones. Fot this procedure, four wells 
of a microtiter plate (Costar 3690) were coated by drying ovonight at 37°C with l//g Ae 
oligo or the oligo was covalently attached to BSA witii EEKVNHS activation to coat Has 
plate (360 acetylated BSA (Boehringer Manheim), S77 Mg oligo, 40mM NHS, and 

20 lOOmM EDC were combined in 1 .8 ml total volume and incubated overnight at room 
temperature. The plates were coated using SO^d per plate and incubated at 4''C overnight 
The wells were washed twice with water and blocked by conq>letely filling the well with 
3% (wAr) BSA in PBS and maintaining &e plate at 37''C for one hour. After the blocking 
solution was shaken out, SO til of the phage suspension prepared above (typically 10'^ pfu) 

25 were admixed to each well, and the plate was maintained for 2 hoturs at 37''C. 
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Phage were removed and the plate was washed once with water. Each well was then 
washed 10 times with TBS/Tween (50 mM Tris-HCl at pH 7.5, 150 mM NaCl, 0.5% 
Tween 20) over a period of 1 hour at room temperature ^ere the washing consisted of 
pipetting up and down to wash the well, each time allowing the well to remain completely 
5 filled with TBS/Tween between washings. The plate was washed once more with 
distilled water and adherent phage were eluted by the addition of SO ul of elution buffer 
(0. 1 M HCl, adjusted to pH 22 with solid glycine, containing 1 mg/ml BSA) to each well 
followed by maintenance at room temperature for 10 minutes. The elution buffer was 
pipetted xnp and down several times, removed, and neutralized with 3 ul of 2 M Tris base 
10 per 50 ul of elution buffer used. 

Eluted phage were used to infect 2 ml of fresh (OD«oo =\)E. coli XLl-Blue cells for 15 
minutes at room temperature, after ^^ch time 10 ml of SB containing 20 ug^ 
carbenicillin and 10 ug/ml tetracycline was admixed. Aliquots of 20, 10, and 1/10 ul were 
removed from the culture for plating to determine the number of phage (packaged 

1 5 phagemids) that were eluted from the plate. The culture was shaken for 1 hour at 37''C, 
after w4iidi it was added to 100 ml ofSB containing SO ug^ carbenicillin and lOug/ml 
tetracycline and shaken for 1 hour. Helper phage VCSM13 (10" pfii) were then added 
and the culture was shaken for an additional 2 hours. After this time, 70 ug^kanamydn 
was added and Ae culture was incul)ated at 37°C overnight Phage preparation and 

20 further panning were repeated as desmbed above. 

Following each round of panning, the percentage yield of phage were determined, wh«ce 
% yield = (number of phage eluted/number of phage applied) X 100. The initial phage 
input ratio was determined by titering on selective plates to be ^proximately 10" cfii for 
each roimd of panning. The final phage output ratio was determined by infecting two ml 
25 of logarithmic phase XLl-Blue cells as desoibed above and plating aliquots on selective 
plates. From this procedure, clones were selected from tiie Fab library for their ability to 
bind to the new binding sequence oligo. The selected clones had randomized zinc finger 
3 domains. 
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The results from sequential panning of the randomized zinc finger 3 library revealed five 
binding sequences which recognized the new finger 3 site. The native site, GCG, was 
altered to AAA and the following sequences shown in Table 1 were identified to bind 
AAA. 
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TABLEl 
BINDING SEQUENCE 



10 



SEQUENCE ID NO. 17 


RSDERKRH' 


SEQUENCE ID NO. 18 


WSI PVL LH 


SEQUENCE ID NO. 19 


WSLLPVLH 


SEQUENCE ID NO. 20 


FSFLLPLH 


SEQUENCE ID NO. 21 


LSTWRGWH 


SEQUENCE ID NO. 22 


TSIQLPYH 



' RSD ERK RH is the native Finger 3 binding sequoice. 



EXAMPLE 6 

COTRANSFORMATION ASSAY FOR IDENTIFICATION 
OF ZINC FINGER ACTIVATION OF PROMOTER 



15 In order to assess the fiinctional properties of the new zinc fingers generated, an K coli 
based in vivo system has been devised. This system utilizes two plasmids with the 
compatible replicons colEl and plS. Cytosplamic expression of tiie zinc finger is 
provided by the arabinase promoter in the colEl plasmid. The pi 5 replica containing 
plasmid contains a zinc finger binding site in place of the repressor binding site in a 

20 plasmid v^chejqnesses the a fragment of p galactosidase. The binding ofthe zinc finger 
to this site on the second plasmid shuts-ofT the production of p galactosidase and thus 
novel zinc fingers can be assessed in this in vivo assay for function using a convenient 
blueAitliite selection. For example, in tiie presence of arabinose and lactose, the zinc 
finger gene is e^qiressed, the protein product binds to the zinc finger binding site and 

25 represses the lactose promoter. Therefore, no p-galactosidase is produced and white 
plaques would be present This system which is compatible with respect to restriction 
sites vnth pComb3.5, will facilitate the rapid characterization of novel fingers. 
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Furthennore, this approach could be extended to allow for the genetic selection of novel 
transcriptional regulators. 

Another method of mutagenizing a wild type zinc finger-nucleotide binding protein 
includes segmental shufQing using a PCR techniqiie which allows for the shufQing of 
5 gene segments between collections of genes. Preferably, the genes contain limited 
regions of homology, and at least 15 base pairs of contiguous sequence identity. 
Collections of zinc finger genes in the vector pComb3.5 are used as templates for the PCR 
technique. Four cycles of PCR are poformed by denaturation, for example, for 1 min at 
94'C and annealling of 50"C for 15 seconds. In separate experiments PCR is performed 

1 0 at 94'C, 1 min, 50»C, 30 sec; 94», 1 min, 50% 1 min; 94% 1 min, 50% 15 sec, 72% 1 sec. 
All Kcperiments use the same template (a 1 Ong mixture). The e:q}eriment is performed 
sudi that under each condition two sets of reactions are performed. Each set has only a 
top or a bottom strand primer, which leads to the generation of single-stranded DNA's of 
diflferent lengflis. For example, FTX3, ZFIF and FZF3 primers may be used in a separate 

15 set to give single stranded products. The products from these reactions are then pooled 
and additional 5' and 3' terminal primers (e.g., FTX3 and R3B) are added and the mbc is 
subjected to 35 additional rounds of PCR at 94''C, 1 min, 50% 15 sec, 72% 1 min 30 sec. 
The resultant mixture may ihen be cloned by Xho I/Spe I digestion. The new shufQed 
zinc fingers can be selected as described above, by panning a display of zinc fingers on 

20 any genetic package for selection of the qnimal zinc-finger collections. This technique 
may be applied to any collection of genes "widdx contain at least 15 bp of contiguous 
sequoice identity. Primers may also be doped to a defined extent as described above 
using the NNK example, to introduce mutations in primer binding regions. Reaction 
times may be varied depending on length of template and number of primers used. 
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EXAMPLE7 
MODIFICATION OF SPECIFICITY OF ZiC68 

Reagents. Strains, and Vectors 

Restriction endonucleases were obtained fiom New England Biolabs or Boehringer 
5 Mannheim. T4 DNA ligase was the product of GEBCO BRL. Taq polymerase and Vent 
polymerase was purchased from Promega Heparin-Sepharose CL-6B medium was from 
Pharmacia. Oligonucleotides were fsom Operon Technologies (Alameda, CA), or 
prepared on a Gene Assembler Plus (Pharmacia LKB) in the laboratory. pZif89 was a gift 
fiom Drs. Pavletich and Pabo (Pavletich, Science, 252:809-817, 1991). Escherichia coli 
1 0 BL21 (DE3)pLysS and plasmid pET3a was from Novagen, Escherichia coli XLl-Blue, 
phage VCSM13, the phagonid vector pComb3, and pAraHA are as desoibed (Baibas m, 
et al, Proc. Natl. Acad. ScL USA, M:7978-7982, 1991; Barbas HI, et al.. Methods: A 
Companion to Methods in Enzymology, 2:1 19-124, 1991). 

Plasmid Constmction 

15 Genes encoding wild-type zinc-finger proteins were placed under the control of the 
Salmonella typhimtirium ardQ promoter by insertion of a DNA fragment amplified by the 
polymerase chain reaction (PCR) and containmg the wild-type ZiQ68 gene of pzif89 
(Pavletich, stg>ra) with the addition of mu'rJple restriction sites (XhoVSacU and 
XbaVSpel). The resulting plasmid vector was subsequently used for subcloning the 

20 selected zinc-finger genes for immunoscreening. In this vector the ancfingo- protein is 
expressed as a fiision with a hemagglutinin decapeptide tag at its C-terminus which may 
be detected widi an anti-dec{4)eptide monoclonal antibody (FIGURE 8A) (Field, et al.. 
Mol. & Cell. Biol S:2159-2165, 1988). The Zi£268 protein is aligned to show the 
conserved features of each zinc finger. The a-heUces and antiparallel p-sheets are 

25 indicated. Six amino-acid residues underlined in each finger sequence were randomized 
in library constructions. The C-terminal end of Zi£268 protein was fiised with a fragment 
containing a decapeptide tag. The position of fusion is indicated by an arrow. 
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The phagemid pComb3 was modified by digestion with Nhel and Xbal to remove the 
antibody light chain fragment, filled with Klenow fiiagment, and the backbone was self- 
ligated, yielding plasmid pComb3.S. The Zif268 PCR fi-agment was inserted into 
pComb3.S as above. To eliminate background problems in library construction a 1.1-kb 
5 nonfunctional staffer was substituted for die wild-type Zi£268 gene using Sad and XbcA.. 
The resulting plasmid was digested by Sad and Xbal to excise the stuffer and the 
pComb3.S backbone was gel-purified and served as the vector for library construction. 

Zine Finger Libraries 

Three zinc-fingra- libraries were constructed by PCR overUq) extension using conditions 

1 0 iH^ously described in Example 3. Briefly, for finger 1 library primer pairs A (S'-GTC 
CAT AAG ATT AGC GGA TCC-3') (SEQ. ID NO:29) and Zfl6rb (SEQ. ID N0:I2); 
(where N is A, T, G, or C, and M is A or C), and B (5'-GTG AGC GAG GAA GCG GAA 
GAG-3') (SEQ. ID NO:30) and Zflf (SEQ. ID NO: 13) were used to amplify fragments 
of Zi£268 grae using plasmid pAra-Z{y255 as a template. Two PCR fisgments were 

1 5 mixed at equal molar ratio and the mixture was used as templates for overlap extension. 
The recombinant fragments were then PCR-amplified using primers A and B, and the 
resulting product was digested with Sad and Xbal and gel purified. For each ligation 
reaction, 280 ng of digested fragment was ligated wi& 1 .8 /^g of pComb3.S vector at room 
temperature overnight Twelve reactions were performed, and the DNA was ethanol- 

20 precipitated and electroporated into K coli XL 1 -Blue. The libraries of finger 2 and 3 
were constructed in a similar manna* except that the PCR primers Zfl 6rb and ZF 1 F used 
in finger 1 library construction were rqilaced by Zfiisi-B (SEQ. ID NO:10) and ZF2r6F 
(SEQ. ID NO: 1 1) (where K is G or T) for finger 2 Ubrary, and by BZF3 (SEQ. ID NO:7) 
and ZF36K (SEQ. ID N0:8) for finger 3 library. In the libraries, six amino-acid residues 

25 corresponding to the a-helix positions -1, 2, 3, 4, S, 6 of finger 1 and 3, positions -2, -1, 
1, 2, 3, 4 of finger 2 were randomized (FIGURE 8A). 
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/fi Vitro Sele ction of Zinc Finpers 

A 34-nucleotide haiipin DNA containing either consensus or altered Zif268 binding site 
was used for zinc-finger selection (FIGURE 8). The consensus binding site is denoted 
as Z268N (5'-CCT QQQ TQQ Q£Q CCC nTT GGG QQQ CCA CCC AGG-S") (SEQ. 
5 ID NO: 3 1). The altered site for finger 1 is TGT (5'-CCT GCG TGG IQI CCC TTTT 
GGG ACA CAA CGC AGG-3') for finger 2 is TTG (5'-CCT GCG HQ GCG CCC 
TTTT GGG CGC CAA CGC AGG-3') and lor finger 3 is CTG (5'-CCT dQ TGG GCG 
CCC TTTT GGG CGC CCA CAQAGG-S"). The oligonucleotide was synthesized with 
a primary n-hexyl amino group at its S' end. A DNA-BSA conjugate was prepared by 

1 0 mixing 30 mM DNA with 3 //M acetylated BSA in a solution containing 100 mM l-(3- 
dimetbylaminopropyl)-3-ethylcarbodiimide hydrochloride (EDCI) and 40 mM N- 
hydroxysuccinimide (NHS) as room tonperature for S^urs or ovonight Zi£268 phage, 
10" colony forming units, in 50 /zl zinc buffer (lOmM Tris-Q, pH 7.5, 90 mM KCl, 1 
mM MgClz, 90 uM ZnClj, ImM MgClj and 5 mM DTT) containing 1% BSA was qjplied 

15 to a microtiter well precoated with 4.9 Mg of DNA-BSA conjugate in 25 ^1 PBS buffer (10 
mM potassium phosphate, 1 60 mM NaCl, pH 7.4) per well. After 2 hours of incubation 
at 37»C, the phage was rranoved and the plate washed once by TBS buffer (50 mM Tris- 
Cl and 150 mM NaCI, pH 7.5) containing 0.5% Tween for the first round of selection. 
The plate was washed 5 times for round 2, and 1 0 times for further rounds. Bound phage 

20 was extracted with eludon buffer (0. 1 M HCl, pH 2.2 (adjusted with glycine), and 1% 
BSA), and used in infect K coli XL 1-Blue cells to produce phage for the subsequent 
selection. 

Immunoscreening 

Mutant zinc fingo- genes selected a&ex five or six rounds of panning were subcloned into 
25 the pAraHA vector using JOtol and Spel restriction sites. Typically, 20 clones were 
screened at a time. Cells were grown at 37°C to late-log phase (OD«x)0.8-l) in Ae 6 ml 
SB media (Barbas III, et al, supra) containing 30 Mg/ml chloramphenicol. Expression 
of zinc-finger proteins was induced with addition of 1% of arabinose. Cells were 
harvested 3 to 12 hours following induction. Cell pellets were resuspended in 600 lA anc 
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bufier containing 0.5 mM phenylmethylsulfonyl fluoride (PMSF). Cells were lysed with 
6-fieeze-thaw cycles and the supernatant was clarified by centrifugation at 12,000g for 
5 minutes. A 50 /il-aliquot of cell supernatant was applied to a microtiter well precoated 
with 1.1 /ig of DNA-BSA conjugate. After 1 hour at 37"C, the plate was washed 10 times 
5 with distilled water, and an alkaline phosphatase conjugated anti-decapeptide antibody 
was added to the plate. After 30 minutes at 37''C, the plate was washed 10 times and p- 
nitrophenylphosphate was added. The plate was ibsn monitored with a microplate 
autoreader at 405 nm. 

Overexpression and Pnrification of Zinc-Finger Proteins 

1 0 Zinc finger proteins were overproduced by using the pET expression system (Studier, et 
ai. Methods Enzymol, 135:60-89, 1990). the ZiQ68 gene was introduced following 
PCR into M/el and fOmHI digested vector pET3a. Subsequently, the Zi£268 gene was 
replaced with a 680-bp nonfimctional stuffer firagment The resulting pET plasmid 
containing the stuffer fiagment was used for cloning other anc-finger genes by replacing 

15 the stuffer with zinc-finger genes using iS^I and A^I sites. The pET plasmids encoding 
zinc-finger genes y/ett introduced into BL21(DE3)pLysS by chemical transformation. 
Cells were grown to mid-log phase (ODeooO.4-0.6) in SB medium containing 50 Axg/ml 
caibaiicillin and 30 ^^g^ diloranq)henicol. Protein eiqjression was induced by addition 
of 0.7 mM IPTG to tiie mediuuL Typically, 500-ml cultures were harvested fliree hours 

20 after induction. Cell pellets were resuspoided in the zinc buffer containing 1 mM PMSF 
and cell were lysed by sonication for 5 minutes at Q'*C. Following addition of 6mM' 
MgClj, cell lysate were incubated wi& 10 /«g/ml DNase I for 20 minutes on ice. Inclusion 
bodies containing anc finger protein were collected by centiifiigation at 25,000g for 30 
mi mitftg and woe resuspended and solubilized in 10 ml Zinc buffer containing 6M urea 

25 and 0.5 mM PMSF with gentle mixing foi 3 to 12 hours at 4"C. The extract was clarified 
by centrifugation at 30,000g for 30 mmutes and filtered through a 0.2-Mni low protein 
bindmg filter. Total protein extract was sailed to a Heparin-Sepharose FPLC column 
(1.6 X 4.5 cm) equilibrated wifli zinc buffer. Proteins were eluted with a 0-0.7 M NaCl 
gradient. Fractions containing zinc-finger protein were identified by SDS-PAGE and 
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pooled. Protein concentration was determined by the Bradford method using BSA 
(fraction V) as a standard (Bradford, Anal. Biochem, 22:248-254, 1976). The yield of 
purified protein was from 7 to 19 mg/liter of cell culture. Protein was over 90% 
homogeneotis as judged by SDS-PAGE. 

5 Kinetic Analysis 

The kinetic constants for the interactions between ZiQ68 pqitides and their DNA targets 
were determined by sur&ce plasmon resonance based analysis using the BIAcore 
instrument (Pharmacia) (Malmqvist, Ctar. Opinion in /mmuno., 1:282-286, 1993). The 
sur&ce of a sensor chip was activated with a mixture of EDCI and NHS for 1 S minutes. 

10 Thm 40 MlofafBnity purified streptavidin (Pierce), 200 Mg/ml in I OmM sodium acetate 
(pH4.5), was injected at a rate of 5 /A/naa. Typically, 5000-6000 resonance units of 
streptavidin were immobilized on the diip. Excess ester groups were qumched with 30 
/il of IM ethanolamine. Oligonucleotides were immobilized onto the chip by injection 
of 40 fA of biotinylated oUgonucleotides (50 ngfrcl) in 0.3 M of sodium chloride. Usually 

15 1500-3000 resonance units ofoUgomers were immobilized. The association rate (AgJ was 
determined by studying the rate of binding of the protein to Hbe surface at 5 different 
protein concoitiations ranging finm 10 to 200 ng/nl in the zinc buffer. The dissociation 
rate (k^ was determmed by increasing flow rate to 20 /<l/min after association phase. 
Hie value is die average of three measurements. The and Ag^ value were calculated 

20 using Biacore® kinetics evaluation software. The equilibrium dissociation constants were 
deduced from the rate constants. 

EXAMPLE 9 

PHAGEMID DI SPLAY OF MODIFIED ZINC FINGERS 

Library Desifn and Selection 
25 Phage display of the Zif268 protein was achieved by modification of the phagemid 
display system pComb3 as described m Examples 2-6. The Zif268 sequence from pzif89 
was tailored by PCR for insertion between the Xhol and Spel sites of pComb3.5. As 
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described above in Bcample 4, insertion at these sites results in the fusion of ZiG68 with 
the carboxyl terminal segment of the filamentous phage coat protein HI, pIII, gene. A 
single panning e:q)eriment v^ch consists of incubating the phage displaying the zinc 
finger protein with the tai^et DNA sequence immobilized on a microtiter well followed 
5 by washing, elufion, and titering of eluted phage was utilized to examine the fimctional 
properties of the protein display ed on the phage sur&ce. 

In control oq)eriments, phage displaying Zi£268 were examined in a panning experiment 
to bind a target sequence bearing its consensus binding site or tiie binding site of the first 
three fingers of TFIIIA, These experiments showed that Zif268 displaying phage bound 
10 the q)propriate target DNA sequence 9-fold over the TFfflA sequence or BSA and 
demonstrated tiiat sequence specific binding of the finger complex is maintained during 

lAage display. A 4-fold reduction in phage bindnig was noted vihen Zn** and DTT were 
not included in the binding buffer. Two r^rts verify that ZiQ68 can be displayed on 
Ae phage surfece (Rebar, et d., Science, 262:671-673, 1994; Jamieson, et al, Biochem., 
15 32:5689-5695, 1994). 

In a siinilar ejqjeriment, Ae first three fingers of TFIIIA were di^layed on tiie surfece of 
phage and also shown to retam specific binding activity. Immobilization of DNA was 
fecilitated by the design of stable hairpin sequence whidi present the di^lex DNA target 
of Ae fingers within a single oligonucleotide which was amino labeled (FIGURE 8B) 
20 (Antao, et al.. Nucleic Acids Research, 12:5901-5905. 1991). The hairpin DNA 
containing the 9-bp consensus binding site (5'-GCGTGGGCG-3', as enclosed) of wild- 
type Zif268 was used for afBnity selection of phage-di^layed zinc finger proteins. In 
addition, &e 3-bp subsites (boxed) of consensus fflV-l DNA sequence were substituted 
for wild-type Zi£268 3-bp subsites for affinity selection. 

25 The amino linker allowed for covalent coupling of the haiipin sequence to acetylated BSA 
which was then immobilized for selection expmments by adsorption to polystyrene 
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microtiter wells. Biotinylated hairpin sequences woiked equally well for selection 
following immobilization to streptavidin coated plate. 

Libraries of each of the three fingers of Zi£268 were independently constructed using the 
previously described overlap PCR mutagenesis strategy (Barbas ni, et al. Proc. Natl. 
5 Acad. Sci. USA, 22:4457-4461, 1992 and EXAMPLES 2-6). Randomization was limited 
to six positions due to constraints in the size of libraries which can be routinely 
constnicted (Barbas m, Curr. Opinion in Biotech, 4:526-530, 1993). Zinc finger protein 
recognition of DNA involves an antiparallel arrangement of protein in the major groove 
of DNA, i.e., the amino temunal region in involved in 3' contacts witii the target sequence 

1 0 i^ereas the carboxyl tominal region is involved in 5' contacts (FIGURE 8B). Within a 
given finger/DNA subsite complex, contacts remain antiparallel i^ere in Qn%a 1 of 
ZiQ68, guanidiniimi groiqs of Arg at helix positions -I and 6 hydrogen bond with the 3' 
and 5' guanines, respectively of the GCG target sequence. Contact with the central base 
in a triplet subsite sequence by the side chain of the helix position 3 residue is observed 

15 in finger 2 of Zi£268, fingers 4 and 5 of GLI, and fingers 1 and 2 of TTK. Within tiie 
three reported crystal structures of zinc-finger/DNA complexes direct base contact has 
been observed betweoi die side-chmns of residues -1 to 6 with tiie exception of 4 
(Pavletich, sipra; Pavletich, Science, 261:1701-1707, 1993; Fairall, et al.. Nature, 
2^:483-487, 1993). 

20 Based on these observations, residues corresponding to the helix positions -1, 2, 3, 4, 5, 
and 6 were randomized in the finger 1 and 3 libraries. The Ser of position 1 was 
conserved in these experiments since it is well conserved at this position in zinc finger 
sequences in general and completely conserved in Zi£268 (Jacobs, EMBOJ., 11:4507- 
45 1 7, 1 992). In the finger 2 library, helix positions -2, -1 , 1 , 2, 3, and 4 were randomized 

25 to explore a different mutagenesis strategy where the -2 position is examined since both 
Zi£268 and GLI structures reveal tiiis position to be involved in phosphate contacts and 
since it will have a context effect on the rest of the domain. Residues 5 and 6 were fixed 
since the target sequence TTG retained the 5' thymidine of the wild type TGG site. 
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Introduction of ligated DNA by electroporation resulted in the construction of libraries 
consisting of 2 x 10', 6 x 10', and 7x10' independent transformants for finger libraries 

1. 2, and 3, respectively. Each library results in the display of the mutagenized finger in 
the context of the two remaining fingers of wild-type sequences. 

5 EXAMPLE 10 

SEQUENCE ANALYSIS OF SELECTED FINGERS 

In order to examine fte potential of modifying zinc-fingors to bind defined targets and to 
examine their potential in g&as idaexapy, a conserved sequence within the HIV-1 genome 
was chosen as a target sequence. The S' leader sequence of HTV-l HXB2 clone at 
1 0 positions 1 06 to 121 relative to the transcriptional iiutiation start site rqjresCTts one of 
several conserved regions witiiin HIV-1 goiomes (Yu, et aL, Proc. Natl Acad. Sci. USA, 
22:6340-6344, 1993); Myers, et al.. 1992). For these expeiimraits, the 9 base pair region, 
113 to 121, shown in FIGURE 8B, was targeted. 

Following selection for binding the native consoisus or HIV-1 target sequences, 
15 functional zinc fingers were rapidly id^tified with an immunoscreening assay. 
E}q)ression of the selected proteins in a pAraHA derivative resulted in the fiision of the 
mutant ZiQ68 proteins witii a peptide tag sequence recognized by a monoclonal antibody 
(FIGURE 8A). Binding was determined in an ELISA format using crude cell lysates. A 
qualitative assessmrait of ^ecificity can also be achieved with this methodology \N4uch 
20 is sensitive to at least 4-foId difTerences in afBnity. Several positive clones fix}m each 
selection were sequenced and are shown in FIGURE 9. The six randomized residues of 
finger 1 and 3 are at positions -1, 2, 3, 4, S, and 6 in the a-helical region, and at -2, -1, 1, 

2. 3, and 4 in finger 2 (FIGURE 9). The three nucleotides denote the binding site used 
for a£Bnity selection of each finger. Proteins studied in detail are indicated with a clone 

25 designation. 
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Finger 1 selection with the consensus binding site GCG revealed a strong selection for 
Lys at position -1 and Arg at position 6. Covariation between positions -1 and 2 is 
observed in three clones ^^ch contain Lys and Cys at these positions respectively. Clone 
C7 was preferentially enriched in the selection based on its occurrence in 3 of the 12 
5 clones sequenced. Selection against the HTV-l target sequence in this region, TGT, 
revealed a diversity of sequraices with a selection for residues with hydrogen-bonding side 
chains in position -1 and a modest selection for Gin at position 3. Finger 2 selection 
against the consensus TGG subsite showed a selection for an aromatic residue at -1 
whereas selection against the HTV-l target TTG demonstrated a selection for a basic 

10 residue at this position. The preference for Ser at position 3 may be relevant in the 
recognition of thymidine. Contact ofthymine with Ser has been observed in the GLI and 
TTK. structures (Pavietich, svpra; Faiiall, et aL, siprd). Other modest selections towards 
consensus residues can be observed within the table. Selections were performed utilizing 
a supE strain of K coli which resulted in the reading of the amber codon TAG as a Gin 

1 5 during translation. Of the 5 1 sequences presented in FIGURE 9, 1 4 clones possessed a 
single amber codoiL No clones possessed more than one amber codon. Selection for 
svfipression of the amber stop codon in sap'E stains has been noted in othor DNA bindmg 
protein libraries and likely improves the quality of the library since this residue is 
fiequ»itly used as a contact residue in DNA binding proteins (Huang, et al. Proc. Natl. 

20 Acad ScL USA, 21:3969-3973, 1994). Selection for fingers containmg fi«e cysteines is 
also noted and likely reflects the experimental protocol. Phage were incubated in a buffer 
containing Zji*^ and DTT to maximize the number of phage bearing properly folded 
fingers. Selection against fiee cysteines, presumably due to aggregation or improper 
folding, has been noted previously in phage display libraries of other proteins (Lowman, 

25 «r a/.. y.M)/. 5/0/., 224:564-578, 1993). 

For further characterization, high level expression of zinc finger proteins was achieved 
using the T7 promoter (FIGURE 10) (Studier, et al., supra). In FIGURE 10, proteins 
were separated by 1 5% SDS-PAGE and stamed with Coomassie brilliant blue. Lane 1 : 
molecular weight standards (kDa). Lane 2: cell extract before IPTG induction. Lane 3: 
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cell »ctract after IPTG induction. Lane 4: cytoplasmic fiaction aftra* ranoval of inclusion 
bodies by centriiugation. Lane 5: inclusion bodies containing nnc finger peptide. Lane 
6: mutant Zif268 peptide purified by Heparin-Sqiharose FPLC. Clones CI 0, F8, and G3 
each possessed an amber codon vAdch was converted to CAG to encode for Gin prior to 
5 expression in this system. 

EXAMPLE 11 

CHARACTERIZATION OF AFFINITY AND SPECIFiaTY 

In order to gain insight into the mechanism of altered specificity or afiOnity, the kinetics 
of binding was determined using real-time changes in surface plasmon resonance (SPR) 

10 (Malmqvist, fupra). The kinetic constants and calculated equilibrium dissociation 
constants of 11 proteins are shown in FIGURE II. Each zinc finger protein studied is 
indicated by a clone designation (for its sequence, see FIGURE 9). The target DNA site 
used for selection of eadi finger is indicated in bold fiice. The consoisus bmding site for 
the wild type protein is also shown in bold. The non-hairpin di^lex DNA (underlined) 

15 was prepared by annealing two single-stranded DNAs. The A^a, association rate; 
dissociation rate; K^, equilibrium dissociation constant for each protein is given. 

The calculated equilibrium dissociation constants for Zi068 binding to its consensus 
sequence in the form of tiie designed hairpin or a linear duplex lacking the tetrathymidine 
loop are virtually identical suggesting tiiat the conformation of the duplex sequence 
20 recognized by tiie protein is not perturbed in conformation within the hairpin. The value 
of 6.5 nM for Zi£268 binding to its consensus is in the range of 0.5 to 6 nM reported using 
electrophoretic mobility shift assays for this protein binding to its consensus sequence 
within oligonucleotides of different length and sequence (Pavletich, siqirtr, Rebar, 5upra; 
Jamieson, et al, supra). 

25 As a measure of specificity, the afBnity of each protein was determined for binding to the 
native consensus sequence and a mutant sequence in which one finger subsite had been 
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changed. FIGURE 1 1 shows the detennination of dissociation rate (k^) of wild-Qrpe 
ZiC268 protein (WT) and its variant C7 by real-time changes in surface plasmon 
resonance. The response of the instnunent, r, is proportional to [protein-DNA] complex. 
Since dr/dt = vAien [protein] = 0, then Aofr = In (^t AnyCVti). >^ere r„ is the response 
5 at time t„. The results of a single e:qpaiment 'for each protein are shown. Three 
ejqjeriments were performed to produce the values shown in FIGURE 1 1 . Clone C7 is 
improved 13-fold in afGnity for binding the wild-type sequence GCG. The major 
contribution to fins improvonent in afBnity is a S-fold slowing of the dissociation rate of 
the complex (FIGURE 12). Specificity of the C7 protein is also improved 9-fold witii 

1 0 respect to the HTV-l target seqiience. This result suggest that additional or improved 
contacts are made in the complex. Studies of protein C9 demonstrate a different 
mechanism of improved specificity. In this case the ovraall afGnity of C9 for the GCG 
site is equivalent to S£268 but the specificity is improved 3-fold over Zi£268 for binding 
to the TGT target site by an increase in the off-rate of diis complex. Characterization of 

1 5 proteins F8 and Fl 5 demonstrate that the 3 base pair recognition subsite of finger 1 can 
be completely changed to TGT and that new fingers can be selected to bind this site. 

CharacteriTHtion of proteins modified in the fingo* 2 domain and selected to bind die TTG 
subsite reveal ibe sped&dty of this finger is amenable to modification. Proteins G4 and 
G6 bind an oligonucleotide bearing the new subsite with afSnities equivalent to Zif268 

20 binding its consensus target Specificity of these proteins for the target on which Aey 
were selected to bind is demonstrated by an approximately 4-fold better afGnity for this 
oligonucleotide as comiMued to tiie native binding site ^^ch differs by a single base pair. 
Hiis level of discrimination is similar to that reported for a finger 1 mutant (Jamieson, et 
al, supra). The finger 3 modified protein A14 was selected to bind the native finger 3 

25 subsite and binds diis site with an afGnity which is only 2-fold lower than Zi£268. Note 
that protein A14 differs radically in sequence firom the native protein in the recognition 
subsite. Sequence specificity in 10 of the 1 1 proteins characterized was provided by 
differences in the stability of the complex. Only a single protein, G6, achieved specificity 
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by a dramatic change in on-rate. Examination of on-rate variation with charge variation 
of the protein did not reveal a correlation. 

niMFRTC ZINC FINCER CONSTRUCTION 

Zinc finger proteins of the invention can be manipulated to recognize and bind to 
extended target sequmces. For example, zinc finger proteins containing fit>m about 2 to 
12 zinc fingers Zif(2) to Zif(12) may be fused to the leucine zipper domains of the 
Jun/Fos proteins, prototypical members of the bZIP &mily of proteins (O'Shea, et al. 
Science, 254:539, 1991). Alternatively, zinc finger proteins can be fiised to other proteins 
which are capable of forming heterodimers and contain dimerization domains. Such 
protdns will be known to those of ddll in &e art 

The Jun/Fos leucine zippers preferentially form heterodimers and allow for the 
recognition of 12 to 72 base pairs. Henceforth, Jun/Fos refer to the leucine zipper 
domains of Aese proteins. Zinc finger proteins are fiised to Jun, and independoitly to Fos 
15 by mediods commonly used in the art to link proteins. Following purification, the Zif-Jun 
and Zif-Fos constructs (FIGURE 13 and 14, respectively), the proteins are mixed to 
spontaneously form a Zif-Jun/Zif-Fos heterodimer. Alternatively, coexpression of the 
genes oicoding diese proteins results in the formation of Zif-Jun/Zif-Fos heterodimers in 
vivo. Fusion with an N-terminal nuclear localization sijgnal allows for targeting of 
20 expression to the nucleus (Calderon, et al. Cell, 41:499, 1982). Activation domains may 
also be incorporated into one or each of the leucine zipper fiision constructs to produce 
activators of transcription (Sadowski, et al.. Gene, 111:137, 1992). These dimeric 
constructs then allow for specific activation or repression of transcription. These 
heterodimeric Zif constructs are advantageous since they allow for recognition of 
25 palindromic sequences (if the fingers on boA Jun and Fos recognize the same DNA/RNA 
sequence) or extended asymmetric sequences (if the fingers on Jun and Fos recognize 
different DNA/RNA sequences). For example the palindromic sequence 



5 



10 
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5' - GGC CCA CGC jNl GCG TGG GCG - 3' 

3 ' - GCG GGT GCG W}^ CGC ACC CGC - 5 ' (SEQ ID NO: 37) 

is recognized by the ZiQ68-Fos/ZiQ68 Jun dimer (x is any number). The spacing 

between subsites is determined by the site of fiasion of Zif with the Jun or Fos zipper 

5 domains and the length oftheiinko- between the Zifand zipper domains. Subsite spacing 

is determined by a binding site selection method as is common to those skilled in the art 

(Thiesen, et al., Nucleic Acids Research, 14:3203, 1990). Example of the recognition of 

an extended asymmetric sequence is shown by Zif(C7) (-JunyZif-268-Fos dimer. This 

protein consists of 6 fingers of the C7 type (EXAMPLE 1 1) linked to Jun and three 

1 0 fingers of Zi068 linked to Fos, and recognizes the extended sequence: 

5' - CGC CGC CGC CGC CGC CGC |n|. GCG TGG GCG - 3 ' 
3' - GCG GCG GCG GCG GCG GCG InJx CGC ACC CGC - 5' 
(SEQ ID NO: 38) 



EXAMPLE 13 

15 rONSTRUCTION OF MULTIFINGER PROTEINS UTILIZING REPEATS 

OF THE FI RST FINGER OF ZIF268 

Following mutagenesis and selection of variants of ^ Zi£268 protein in vMdi the finger 
1 specificity or afBnity was modified (See EXAMPLE 7), proteins carrying multiple 
copies of the finger may be constructed using the TGEKP linker sequrace by methods 
20 known in the art For example, the C7 finger may be constructed according to ^ 
scheme: 

MKLLEPYACPVESCDRRFSKSADLKRHIRHIQEKE- 



25 



(YACPVESCDRRFSKSADLBCHIRIHIQEKE) wi, where the sequence of the last Unker 
is subject to change since it is at the terminus and not involved in linking two fingers 
together. An example of a three finger C7 construction is shown in Figure 15. This 
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protein binds the designed target sequence GCG-GCG-GCG (SEQ ID NO: 32) in the 
oligonucleotide haiipin CCT-CGC-CGC-CGC-GGG-TTT-TCC-CGC-GCC-CCC GAG 
G with an afBnity of 9nM, as compared to an affinity of 300 nM for an oligonucleotide 
encoding the GCG-TGG-GCG sequence (as determined by surface plasmon resonance 
5 studies). Proteins containing 2 to 1 2 copies of the C7 finger have been constructed and 
shown to have specificity for their predicted targets as determined by ELISA (see for 
example. Example 7). Fingers utilized need not be identical and may be mixed and 
matched to produce proteins wfaidi recognize a desired target sequence. Thesemayalso 
be utilized with leucine zippers (e.g., Fos/Jun) to produce proteins witti extended 
1 0 sequence recognition. 

In addition to producing polymers of finger 1 , the entire three finger ZiQ68 and modified 
versions herein may be fijsed using the consensus linker TGEKP to produce proteins widi 
extended recognition sites. For example, FIGURE 1 6 shows the sequence of the protein 
Zif268-ZiC268 in vAdch tiie natural protein has been fused to itself using the TGEKP 
15 Unker. This protein now binds the sequence GCG-TGG-GCG-GCG-TGG-GCG as 
demonstrated by ELISA. Thoefore modifications within the three fingers of Zif26S may 
be fused together to form a protein vMch recognizes extended sequences. These new 
anc proteins may also be used in combination with leucine zippers if desired, as 
described in Example 12. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: THE SCRIPPS RESEARCH INSTITUTE 

(ii) TITLE OF INVENTION: ZINC FINGER PROTEIN DERIVATIVES AND 
5 METHODS THEREFOR 

(iii) NUMBER OF SEQUENCES: 31 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Spensley Horn Jubas & Lubitz 
10 (B) STREET: 1B80 Century Peirk Bast, Suite 500 

(C) CITY: Los Angeles 

(D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 90067 

15 (v) CamUTER READABI^ FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTQ4: PC-DOS/MS-DOS 

(D) SOFTHARB: Patentin Release #1.0, Version #1.25 

20 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/nS95/ 

(B) FILING DATE: lB-JAN-1995 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 
25 (A) NAME: Halle, Lisa A. , Ph.D. 

(B) REGISTRATION NUMBER: 38,347 

(C) REFERENCE/DOCKET NUMBER: FD-4083 

(Ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 619-455-5100 
30 (B) TELEFAX: 619-455-5110 



(2) INFORMATION FOR SEQ ID N0:1: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) liENGTB: 25 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .25 

10 (D) OTHER INFORMATION: /note= "Where Tyr appears, Tyr can 

also be Phe" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

Tyr Xaa Cys Xaa Xaa Cys X£ia Xaa Xaa Phe Xaa Xaa Xaa Xaa Xeia Leu 
15 10 15 

15 Xaa Xcia His Xaa Xaa Xaa His Xaa Xaa 

20 25 

(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 
25 (B) CLONE: ZP 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .36 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 



30 ATGAAACTGC TCGAGCCCTA TGCTTGCCCT GTCGAG 



36 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECOLE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZR 

10 (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GAGGAGGAGG AGACTAGTGT CCTTCTGTCT TAAATGGATT TTGGT 45 
15 (2) INFORMATION FOR SEQ ID NO: 4: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
20 (D) TOPOLOGY: linear 

(ii) MOLECOLE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: zi£268Xho-Spe 

(ix) FEATURE: 
25 (A) NAME/KEY: CDS 

(B) LOCATION: 1. .273 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
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CTC GAG CCC TAT GCT TGC CCT GTC GAG TCC TGC GAT CGC CGC TTT TCT 48 
Leu Glu Pro Tyr Ala Cys Pro Val Glu Ser Cys Asp Arg Arg Phe Ser 
15 10 15 



CGC TCG GAT GAG CTT ACC CGC CAT ATC CGC ATC CAC ACA GGC CAG AAG 96 
Arg Ser Asp Glu Leu Thr Arg His lie Arg He His Thr Gly Gin Lys 
20 25 30 



CCC TTC CAG TGT CGA ATA TGC ATG CGT AAC TTC AGT CGT AGT GAC CAC 144 
Pro Phe Gin Cys Arg He Cys Met Arg Asn Phe Ser Arg Ser Asp His 
35 40 45 



10 CTT ACC ACC CAC ATC CGC ACC CAC ACA GGC GAG AAG CCT TTT GCC TGT 192 

Leu Thr Thr His He Arg Thr His Thr Gly Glu Lys Pro Phe Ala Cys 
50 55 60 

GAC ATT TGT GGG AGG AAG TTT GCC AG6 AGT GAT GAA CGC AAG AGG CAT 240 
Asp He Cys Gly Arg Lys Phe Ala Azg Ser Asp Glu Arg Lye Arg His 
15 65 70 75 80 

ACC AAA ATC CAT TTA AGA CAG AAG GAC ACT AGT 273 
Thr Lys He His Leii Arg Gin Lys Asp Thr Ser 
85 90 



(2) INFORMATION FOR SEQ ID NO: 5: 

20 (i) SEQUENCE CHASACTERISTICS : 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) HOLECOLB TYPE: protein 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Leu Glu Pro Tyr Ala Cys Pro Val Glu Ser Cys Asp Arg Arg Phe Ser 
15 10 15 

Arg Ser Asp Glu Leu Thr Arg His He Arg He His Thr Gly Gin Lys 
20 25 30 



30 Pro Phe Gin Cys Arg He Cys Met Arg Asn Phe Ser Arg Ser Asp His 
35 40 45 
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Leu Thr Thr His lie Arg Thr His Thr Gly Glu Lys Pro Phe Ala Cys 
50 55 60 

Asp lie Cys Gly Arg Lys Phe Ala Arg Ser Asp Glu Arg Lys Arg His 
65 70 75 80 

5 Thr Lys He His lieu Arg Gin I>ys Asp Thr Ser 

85 90 

(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 22 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRAMDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE : 
15 (B) CLONE: FTX3 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..22 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
20 GCAATTAACC CTCACTAAA6 GG 22 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOTOCE: 
(B) CLONE: BZF3 
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(ix) FEATDRE: 

(A) NAME/KEY: CDS 

(B) liCXa^TION: 1..21 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 
5 GGCAAACTTC CTCCCACAAA T 21 

(2) INFORMATION FOR SEQ ID N0:8: 

(i) SEQTJENCE CHARACTERISTICS: 

(A) LENQTH: 60 base pairs 

(B) TYPE: nucleic acid 
10 <C) STRANDEDNBSS : single 

(D) TOPOLOSY: linear 

(ii) HOI.ECDLE TYPE: DHA (genomic) 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZF36K 

15 (ix) FEATDRE: 

(A) NAME/KEY: CDS 

(B) LOCATION: X..60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
ATTT6TGGGA QGAAGTTTGC CNNKAGTHNK NNKNNKNNKN NKCATACCAA AATCCATTTA 60 



20 (2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
25 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: R3B 

(ix) FBATDRE: 

(A) NAME/KEY: CDS 
5 (B) LOCATION: 1..21 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

TTGATATTCA CAAACGAATG G 21 

(2) INFORMATION FOR SEQ ID NO: 10: 

(1) SBQXJENCE CHARACTERISTICS: 
10 (A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



15 (vii) IMMEDIATE SOURCE: 

(B) CLONE: ZFNsi-B 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .21 



20 (3d.) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CATGCATATT CGACACTGGA A 21 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 66 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZF2r6F 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
5 (B) LOCATION: 1. .66 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



CAC3TGTCGAA TATGCATGCG TAACTTCNNK NNKNNKNNKN NKNNKACCAC CCACATCCGC 60 



ACCCAC 66 
(2) INFORMATION FOR SEQ ID NO:12: 



10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: AFI6rb 



(ix) FEATURE: 

(A) NAME/KEY: CDS 
20 (B) LOCATION: 1..66 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



CTGGCCTGTG TGGATGCGGA TATOINNMNN MNNMNNMNNC GAMNNA6AAA A6CGGCGATC 60 
GCAGGA 66 
(2) INFORMATION FOR SEQ ID NO: 13: 



25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 
(B) djONE: ZFIF 

(ix) FEATURE: 

(A) KAME/KEY: CDS 

(B) LOCATION: 1..24 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CATATCC6CA TCCACACAOG CCAG 24 
10 (2) INFORMATION FOR SEQ ID N0:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..8 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Arg Ser Asp Glu Leu Tbr Arg His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 6 cunino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



wo 95/19431 



PCT/US95/00829 



-97- 



(ii) NOLECOLE TYPE: peptide 



(ix) FBA.TORE: 

(A) NAME/KEY: Peptide 

(B) UiCRTiaS: 1. .6 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Ser Arg Ser Asp His Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO:16: ^ 

(i) SBQI]ENCE CHARACTERISTICS: 
10 (A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid - 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) HOLECDLE TYPE: DNA (genomic) 



15 (ix) FBATORB: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..34 



(xi) SEQTIENCE DESCRIPTION: SEQ ID NO: 16: 
CQTAAATGG6 C6CCCTTTTG GGCGCCCATT TACG 34 
20 (2) INFORMATION FOR SEQ ID NO:17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 



(ii) MOLECDLE TYPE: peptide 



wo 95/19431 



PCTAJS95/00829 



-98- 



(ix) PEATDEE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..8 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

5 Arg Ser Asp Glu Arg Lys Arg His 

1 5 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTB: 8 amino acids 
10 (B) TYPE: amino acid 

(C) STSANDEDHESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECDLE TYPE: peptide 



(ix) FEATURE: 
15 (A) NAME/KEY: Peptide 

(B) LOCATION: 1. .8 



(xi) SEQUENCE DESCRIPTI(»I : SEQ ID NO: 18: 

Trp Ser lie Pro Val Leu Leu His 
1 S 

20 (2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNBSS : single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .8 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Trp Ser Leu Leu Pro Val Leu His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 20: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..8 



(xi) SEQXnNCE DESCRIPTION: SEQ ID NO: 20: 

15 Phe Ser Phe Leu Leu Pro Leu His 

1 5 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8 amino acids 
20 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 
25 (A) NAME/KEY: Peptide 

(B) LOCATION: 1. .8 



(Xi) 



SEQUENCE DESCRIPTION: SEQ ID N0:2i: 
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Leu Ser Thr Trp Arg Gly Trp His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



10 (ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .8 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Thr Ser He Gin Leu Pro Tyr His 
15 1 5 

(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 
25 (B) LOCATION: 1. .61 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



TGATCTCAGA AGCCAAGCAG GGTC6GGCCT GGTTAGTACT TGGATGGGAG ACCGCCTGGG 60 
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(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDBDNBSS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATDRE: 

(A) NAME/KEY: Peptide 
10 (B) LOCATIC»I: 1. .26 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Tyr He Cys Ser Phe Ala Asp Cys Gly Ala Ala Tyr Asn Lys Asn Trp 
15 10 15 

Lys Leu Oln Ala His Leu Cys Lys His Thr 
15 20 25 

(2) INFORMATION FOR SEQ ID NO: 25: 

(1) SEQUENCE CHARACTERISTICS: 

(A) XiBNGTH: 26 amino acids 

(B) TYPE: amino acid 

20 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: Peptide 
25 (B) LOCATION: 1..26 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



Phe Pro Cys Lys Glu Glu Gly Cys Glu Lys Gly Phe Thr Ser Leu His 
15 10 15 
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His Leu Thr Arg His Ser Leu Thr His Thr 
20 25 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRAHDEDNBSS : single 

(D) TOPOLOGY: linear 

(ii) MOIiECOLE TYPE: peptide 



10 (ix) FEATDRE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..26 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:26: 

Phe Thr Cys Asp Ser Asp Gly Cys Asp Leu Arg Phe Thr Thr Lys Ala 
15 1 5 10 15 

Asn Met Lys Lys His Phe Asn Arg Phe Bis 
20 25 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 



25 (ix) FEATDRE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .13 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
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TGGATGGGAO ACC 13 
(2) INFOSMA.TION FOR SEQ ID NO: 28: 

(i) SBQUEKCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 
5 (B) TYPE: amino acid 

(C) STRANDEDKESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATORE: 
10 (A) NAME/KEY: Peptide 

(B) LOCATION: 1..8 



(xi) SEQDENCB DESCRIPTION: SEQ ID NO: 28: 

Arg Ser Asp Glu Arg Lys Arg His 
1 5 

15 (2) INFORMATION FOR SEQ ID NO:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEIOIESS : single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DHA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..21 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:29: 

GTCCATAASA TTAGCGGATC C 21 
(2) INFORMATION FOR SEQ ID N0:30: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DHA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .21 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
10 6TGA6CGA6G AA6C6GAAGA 6 21 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 
20 (B) LOCATION: 1. .34 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
CCTGCGTGGO CGCCCTTTTG GGCGCCCACG CAGG 34 



Although the invention has been described with reference to the presently preferred 
embodiment, it should be understood that various modifications can be made without 
25 departing from the spirit of the invention. Accordingly, the invention is limited only by 
Ae following claims. 
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CLAIMS 

1. An isolated zinc finger-nucleotide binding polypeptide variant comprising at 
least two zinc finger modules that bind to a cellular nucleotide sequence and 
modulate the fimction of the cellular nucleotide sequence. 



2. The variant of claim 1 , \^erein the modulation is enhancement of transcription 
of a gene opoatively linked to the cellular nucleotide sequence. 

3. The variant of claim 1, i^erein the modulation is suppression of transcription 
of a gene operatively linked to the cellular nucleotide sequence. 

4. The variant of claim I , \^ch is d^ved fiom a zmc finger-nucleotide binding 
polypeptide selected from the group consisting of zif 268 and TFIUA. 

5. The variant of claim 1 , wherein the cellular nucleotide sequence is DNA. 

6. The variant of claim 1 , wherein the cellular nucleotide sequraice is RNA. 

7. Thevariantofclaim l,^»4ierem1faepo!; peptidecontainsalinkerregionbetweai 
zinc fingers consisting essentially of the amino acid sequence TGEKP. 

8. Tlie variant of claim 1 , vtiierein die cellular nucleotide sequence is a structural 
gene nucleotide sequence. 

9. The variant of claim 1 , wherein the cellular nucleotide sequence is a promoter 
nucleotide sequence. 

10. The variant of claim 9, wherein the promoter is an onco-promoter. 
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1 1 . The variant of claim 1 0, wherein the promoter is a viral promoter. 

12. The variant of claim 1 , wherein the cellular nucleotide sequence is a retroviral 
nucleotide sequrace. 

13. The variant of claim 12, wiierein the retrovirus is a human T-cell lymphotrophic 
virus (HTLV). 

14. The variant of claim 13, wherein the retrovirus is HTLV-1 or HTLV-2. 

15. The variant of claim 12, vdierein the retrovirus is a human immimodeficiency 
virus (HIV). 

16. The variant of claim 15, wherein the retrovirus is HTV-l or HrV-2. 

1 7. The variant of claim 1 , wherein the cellular nucleotide sequence is an oncogene 
nucleotide sequence. 

18. The variant of claim 1, wherein die cellular nucleotide sequence is a plant 
cellular nucleotide sequence. 

19. A nucleotide sequence encoding a zinc finger-nucleotide binding polypeptide 
variant of claim 1. 

20. A recombinant «qiression vector containing the zinc finger-nucleotide binding 
polypeptide variant of claim 1 . 
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21. A pharmaceutical composition comprising a therapeutically effective amoimt of 
a zinc finger-nucleotide binding polypeptide derivative or a therapeutically 
effective amount of a nucleotide sequence which encodes a zinc finger- 
nucleotide binding polypeptide derivative, wherein the derivative binds to a 
cellular nucleotide sequoice to modulate the fimction of the cellular nucleotide 
sequence, in combination with a phaimaceutically acceptable carrier. 

22. The pharmaceutical composition of claim 21, v^erein the modulation is 
enhancemoit of transcription of a gene operatively linked to the cellular 
nucleotide sequence. 

23. The pharmaceutical composition of claim 21, ^x^ierein the modulation is 
suppression of transcription of a gene operatively linked to tiie cellular 
nucleotide sequ^ice. 

24. The pharmaceutical composition of claim 21, herein the zinc finger-nucleotide 
binding polypeptide derivative is a truncated wild-type zinc finger-nucleotide 
binding domain. 

25. The pharmaceutical composition of claim 21, wherein the zinc finger binding 
polypqjtide derivative is a variant polypqitide. 

26. A melhod for iiihibiting a ceUular nucleotide sequence conqnisiiig a ancfiiig^'- 
nucleotide binding motif^ the method comprising contacting the motif with an 
effective amount of a anc finger-nucleotide binding polypeptide derivative 
v^ch binds the motif. 



27. 



The method of claim 26, v^erein the zinc finger binding polypeptide derivative 
is a truncated zinc finger protein. 
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28. The method of claim 26, wherein the zinc finger polypeptide derivative is a 
variant polypeptide. 

29. The method of claim 26, >vherein the cellular nucleotide sequence is DNA. 

30. The method of claim 26, wherein the cellular niicleotide sequence is RNA. 

3 1 . The method of claim 26, vt^ierein tbe cellular nucleotide sequence is a structural 
gene nucleotide sequence. 

32. Hie method ofclaim 26, AK^ierein the cellular nucleotide sequence is a promoter 
nucleotide sequrace. 

33. The method of claim 26, wherein the cellular nucleotide sequence is an 
oncogene nucleotide sequence. 

34. The method of claim 26, wherein the cellular nucleotide sequence is a plant 
cellular nucleotide sequence. 

35. The method ofclaim 3 l,vAaein the zinc finger binding polypqjtide derivative 
is a variant polypeptide. 



wo 95/19431 



PCTAJS95/00829 



-109- 



36. A method for obtaining an isolated zinc finger-nucleotide binding polypeptide 
variant vMch binds to a cellular nucleotide sequence comprising: 

a) identifying the amino acids in a zinc finger-nucleotide binding 
polypeptide that bind to a first cellular nucleotide sequence and modulate 

5 the function of the nucleotide sequence; 

b) creating an e:q>ression library encoding the polypeptide variant 
containing randomized substitution of the amino acids identified in step 
a) above; 

c) expressing the library in a suitable host cell; and 

10 d) isolating a clone that produces a polypeptide variant that binds to a 

second cellular nucleotide sequence and modulates the function of tfie 
second nucleotide sequence. 

37. The method of claim 36, vt^erein the library is expressed in a phage sur&ce 
ejqpression system. 

38. The method of claim 36, \(iierein the phage esqnession system includes a 
reducing reagent vMch allows folding of expression products on the phage 
sur&ce. 

39. The method of claim 38, wherein the reducing reagent is dilhiotfareitol. 

40. The m^od of claim 36, v^erein the library is randomized by PCR using 
primers containing degenoate triplet codons at sequence locations 
corresponding to the determined amino acids. 



41. 



The method of claim 36, v^erein the modulation of fimction is rahancement of 
transcription of a gene operatively linked to the cellular nucleotide sequence. 
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42. The method of claim 36, wherein the modiilation of function is suppression of 
transcription of a gene operatively linked to the cellular nucleotide sequence. 

43. The method of claim 36, wherein the cellular nucleotide sequence is DNA. 

44. The method of claim 36, wherein the cellular nucleotide sequence is RNA. 

45. A anc finger-nucleotide binding polypeptide variant produced by the method 
of claim 36. 



46. A method of treating a subject with a cell proliferative disorder, wherein the 
disordrar is associated with the modulation of a celliilar nucleotide sequence 
associated wift a zinc fingo'-nucleotide binding motif, comprising contacting 
the zinc iBnger-nucleotide binding motif with an effective amoimt of a zinc 
finger-nucleotide binding polypeptide derivative that binds to the zinc finger- 
nucleotide binding motif to modulate activity of the cellular nucleotide 
seqxience. 

47. The method of claim 46, v^erein an raqnession vector comprising a 
polynucleotide sequmce oicoding a zinc finger-nucleotide binding polypeptide 
derivative is introduced into the cells of tiie subject 

48. The method of claim 47, herein the ejqnression vector is a virus. 

49. The m^od of claim 46, i^erein the modulation is eiohancement of transoiption 
of a gene operatively linked to the cellular nucleotide sequence. 



50. 



The method of claim 46, >^erein the modulation is siqjpression of transcription 
of a gene operatively linked to the cellular nucleotide sequence. 
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51. A method for identifying a protein which modulates the fimction of a cellular 
nucleotide sequence and binds to a zinc finger-nucleotide binding motif 
comprising: 

a) incubating con^nents comprising a nucleotide sequence encoding the 
putative protein operably linked to a first inducible promoter, and a 
reporter gene operably linked to a second inducible promoter and a zinc 
finger-nucleotide binding motif, v^erein the incubating is carried out 
imder conditions sufGcient to allow the components to hatetact; and 

b) measuring fte effect ofthe putative modulating protein on the ejqnession 
of the reporter gene. 

52. The method of claim 51, \stoein the modulation is inhibition of gene 
esqnession. 

53. The method of claim 51, vt^ierein the modulation is enhancement of gene 
ejqiressiorL 

54. The method of claim 5 1 , v^erein the first inducible promoter is the arabinase 
promoter. 

55. The method of claim 51 , \**erein the second inducible promoter is the lactose 
promoter. 

56. The method ofclaim 51, wherein fte incubating is pofonned in vz/ro. 

57. The method of claim 51 , \n*erein the incubating is performed in vivo. 

58. The method of claim 5 1 , \n4ierein the reporter gene is p-galactosidase. 
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